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(Peptide) FASTA of: gi-50913505 .pep from: 1 to: 1036 September 15, 2004 18:46 

gi | 50913 505 | ref|YP_059477 .1) 'Collagen adhesion protein [Streptococcus pyogenes M 
GAS10394] 



TO : * * . pep Sequences : 



56 Symbols: 



22,803 Word Size: 2 



Scoring matrix: GenRunData : blosum50 . cmp 
Variable pamf actor used 

Gap creation penalty: 12 Gap extension penalty: 2 



Histogram Key: 
Each histogram symbol represents 1 search set sequences 
z -scores computed from opt scores 
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Joining threshold: 39 , opt. threshold: 27, opt. width: 16, reg. -scaled 



The best .scores are: 


initl 


initn 


opt 


z-sc E(54) . 




/home/morama/ gas /pili /align/gi -5 0913 50 5 


.pep 


Begin : 


1 End: 1036 






! gi | 50913505 | ref | YP__059477 . 1 1 Collag..- 


. 6697 


6697 


6697 


3452.1 9 


. 6e- 


-189 


./home/mo rarna /gas /pi li / align /gi -19 2 2 4 14 1 


.pep 


Begin : 


48 


End: 144 






! gil 19224141 1 gbl AAL86412 . 1 1 AF447492_. . 


63 


100 


159 


105.9 


0.023 


/home /mo rama / gas /pili /align/gi-21909 64 0 . pep 


Begin: 


147 


End: 449 






S gi| 21909640 |ref |NP_663908 . l| protei. . 


35 


35 


136 


96.3 


0.08 


/home/mo rama/gas /pili /align/gi ^13 621428 


.pep 


Begin: 


57 


End: 318 






! gi| 13621428 | gb| AAK33238 .1 | hypo the t. ; 


33 


33 


91 


75.6 


1. 


1 


/home /morama /gas /pi 1 i /align/gi - 5 0 9 1 3 5 0 6 


.pep 


Begin : 


33 


End: 428 






! gi 1 50913 506 |ref|YP„0 59478. l| Fimbri... 


70 


149 


86 


71.3 


1. 


9 


/home /morama/ gas /pili /align/gi -13621432 


.pep 


Begin: 


.14 


End:. 56 






! gi J 13621432 | gb | AAK33241 . 1 1 conserve.. 


40 


65 


78 


68 . 0 


2. 


9 


/home /mo rama/gas /pili /align/gi -197 4 53.01 .pep 


Begin: 


241 


End: 466 






! gi| 19745301 |ref |NP_606437„ l| putati. . 


52 


52 


73 


64.8 


4* 


3 


/home /morama /gas /pili /aiign/gas 1 5 . pep 


Begin 


: 492 


End: 


739 






\ GAS15 GAS15 


' 43 


68 


- 69 


6.1.4 


6. 


6 : 


/home /morama / gas /pi 1 i / al ign/ gi -21909636 


.pep 


Begin: 


176 


End: 298 






! gi 1-21909636 jref 1^^663904.11 conser. . 


31 


31 


62 


60.8 


7 . 


1 


/home /morama/ gas /pili /align/gi -2 8810259 


.pep 


Begin : 


181 


End: 303 






I gi| 28810259 |dbj |BAC63197.l| hypo the. . 


31 


31 


62 


60.7 


7. 


2 


/home/morama/gas/pili/align/gi-19224139 


.pep 


Begin : 


90 


End: 143 






I gi 1 1922il39 | gh| AAL8 6410... 1 1 AF447492„. . 


43 


43 


54 


- 58.9 


8. 


9 


/home/morama /gas /pili /align/gi -197453 0 5 


• Pep 


Begin : 


96 


End: .149 






! gi| 19745305|ref |NP_606441.1| hypoth. . 


43 


43 


54 


58.8 




9 


/home /moirama /gas /pili /al igh / or f 8 2 . pep 


Begin 


: 125 


End: 


178 \- * 






H TRANSh&EE of : brf 82.;.seq check: 4296.. 


. 43 


-.43 ; 


- '.54 


58.2* 

* 


9. 


6 


./feome/mbrama- : /gas /psiii / al ign/gi - 219 0963.8 


•pep 


.Begin i 




E-rid : ; i 37 






^! ♦ %*12l9'0 : 9&^ i liYPOth... 


. : 43 


43 • 


. : ;52 


58. o 


9.. 


9 



\\End of List. - ♦ 

' ' 1 :— . ■ , ■ • 

..... ■ .-. 

•= . .. ■ • 

, •• - ■ . - • ■ *• ■ 

•. ... . • . . . m 

igi^fed913505:pep . f . . . " - " : 

/home/morama/gas /pili /aligh/gi-50913 50 5 ."pep 

. - . • . • • ~ • • - .* •* • -• • . •.••••.■*.•«. I*.*'*" -•• - - • * 

• • . ... • 

gi I 50913505 1 ref I YP_059477 . 1 1 Collagen adhesion protein [Streptococcus . pyogenes :M 
-GAS10394] - 

s , . . . - 

SCORES Initl: 6697 Initn: 6697 Opt: 6697 z-score: 3452.1 E() : 9.6e-189 
»/home/morama/gas/pili/align/gi-50913505 .pep (1036 .aa) 

initn: 6697 initl: 6697 opt: 6697 Z-score: 3452.1 expect () : 9.6e-189 

Smith-Waterman score: 6697; 100.0% identity in 1036 aa overlap 

(1-1036:1-1036) 
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QEGSKZ.AIDEFKARYPNI.SIYSIXSVSKDINSDTASSSPWLKYI.SGEEHYYGITDTAELE 

AiAUJ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 ii i it 1 1 1 1 1 1 1 1 1 1 
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gx-50913505 . DIIDKWFTPKTTSQPKGKVTI/TFKSDYKVDDEYT^^ 

7^0 800 810 820 830 840 

850 860 870 880 890 900 

gx-50913505. EMGDSDTDYGTNQTSSGKGGLPSNSDASVHYiyiADGREQKIjPYKHPVI 

rf ™ n , n , iiiilii 1 1 1 1 ' 1 1 * ' i » U « I * I II II I J 1 1 1 1 M J M I Ml 1 1 III II 1 1 II ! I M 1 1 

gi -50913505. EMGDSDTDYGTNQTSSGKGGLPSNSDAST^^ 

85 ° 860 870 880 890 900 

. . •: . 910 - 520 930 940 950 960 

gx -5 0913505. ADl^QKKLAGVEFELI^EDKXIV^ 

1 1 1 1 II 1 1 III I II 1 1 1 1 1 1 1 1 1 1 1 1 i II III Mil) || i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 

gx— 50913505 . ADISJNQE^CLAGVEFELRKEDKKIVWEKGTT 

910 920 930 940- 950 960 

970 980 990 1000 1010 1020 

gx-50913505 . ENPWETVAVANJSTGbiK^KHPIEGE 

• MM 1 1 II 1 1 1 1 II I II 1 1 1 1 1 1 II I Ml III II II Mill III M M Ml M II M M I 

gx-50913505 . ENPWEVAVAltfNGDIl^^ 

970 980 990 1000 . 1010 1020 

1030 

gi-50913505. TVALLFYRRQHRKKQY 

MI II II III Mill I 

gx-50913505. TVALLFYRRQHRiOCQY 

1030 

/kome/mor^ 

^ • * . •.*..••" . 

gi 1 19224141 1 gb J AAIj86412 . %\ AE447492L.9 protein F2 [Streptococci -py^gerife^] 



SCORES : Inifclr 63 Inxtn: 100 Opt: 159 z-score: 105.9 E() ; 0.023 
>>/home/moraina/gas/pili/aligix/gi-1922;4141.pep (1161 aa) 

inifcn:. 100 ihitl: 63 opt: 159 Z-score: 1*05.9 expect (): 0.023 
Smith-Waterman score: 159; 36.7% identity, in 98 aa overlap 

•<895-990:48-144) 

870 880 890 900 910 920 

gi -5 0913505. SDASVETYMA2X3REQKLPYCT 

: | : | : | : I j | : j J : : j | 

gi-1 9224141. FII^IxLLWIGIiSGVSVGHAETRNGA^ 

20 30 40 50 60 70 

■ 

930 * 940 950 960 970 980 

♦ 
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ll»t|, ■"««» t «' II }! .!»»!, .1,1,1, Mtt.lt ■ .„„, „„ t ,j „,„. .,„.. 

• cno-, JtVt'' ! 1 ^ P !=* "J J&L< ! Ji^^l y 
gi-5 0913 5 05 - WERGTTGSNGQMFKYLQKG^^ 

• : ,::::| - : 11 Ml I i I || I hi =|| | |::| :| 
gi-19224141 . VQTFTSISTDKGIVDAQNLQPG-TYTIjKEETA 

80 90 100 110. 120 130 

990 1000 1010 1020 1030 

gi-50913505 . ELKSKIKSSYMIKNYKIYQLiPSSGGRGSQ I FX IVGSMTATVALIjFYRKQHRKKQY 

h II II 

gi-1922414i . E 1 1 SKAG S KDVS S S LQ LENPKMSWS KYGKTEVS SG AADF YKNHAAY F KMS FELKQ KDKS 

140 150 160 170 180 190 

gi-50913505. pep 

/home/morama/gas/pili/align/gi-219 09640 - pep 

gi| 21909640 |ref |NP__663908 .1 1 protein F2 like fibronec tin-binding protein [Strept 
ococcus pyogenes MGAS315] 

SCORES Till tl: 35 Initn; 35 Opt: 136 z -score: 96.3 E(>: 0.08 
^»/home/niorama/gas/pili/align/gi-21909640.pep (656 aa) 

initn; 35 initl: 35 opt: 136 2-score : 96.3 expect ( ) : 0.08 
Smith-Waterman score: 148; 24.5% identity in 339 aa overlap 

(686-1005:147-449) 

660- 670 680 690 700 710 
gi-50913505 . m^RSQEGSKXiAIDEFKARYPNIiSIYSLGVSKDINSDTASSSPVVLr KYIiSGEEHYY 

:|| |: : :|:: : | ||:: 

cri -21909640. TNLQAVJSVEPVXESLPWTSLKPIAQKDITAK^^ 

120 130 140 150 160 170 

720 730 740 750 760 770 

gi-50913505 . GITDTAELEKTLNKIVEDS^ 

: I III— sis :::::: : :| \ | : : : : : : | : : | : | 

gi-21909640 . -AVDDAEEKQ INSEGQQEISVTWT-NQLVT — DEKGMAYIYSWEVDKNGEIiliEP 

180 190 200 210 220 

780 790 800 810 820 - 830 

gi-50913505 . KDQVQ — EAGKDI IDKVVFTPKTTSQPKGKVTIiTFKSDYKA/DDEYTYTIiSFNVKASDEAY^ 

II :: I I * : ,1 I II : * :|| : : z\ | : = : :::: 
gi -.21909.640 . KDYIKKEDGLTVTNTYV KPTSG-HYDIEVTFGNGH-ID — ITEDTTPDUVSGEN 

230 240 250 "■ - 260 270 



850 :.860 
^1-5:093,3505:; ao&tiBNfi&R&S^^ 

... v / : If! IC I IV t\ jte'.zizi f: ; fc *V V . / 

gi^2l909.64;0-. QRfeQ$EGEDSKPI©EyTE — :NijEr£FGi^TMP^ 

- — - • . .... 3^0 /' : 320 ' : • 3 30 ' .V . 



r * • • 



880. .890 900 910 920 • .930 . 

! gi-50913505 EQK^ 

I! - . . * ' * hi I != hllU :IH : I t *\ I=Ji = : , 
gi -21909 640 J EQGQSGDMTIEEDSATHIKFSKRDIDG : — KELAGATMELRDSSGKTI STWISDGQVK 

340. 350* 360 370' 380 

• . m • 

940 950 960 970 980 990 

gi-50913505 . FKYLrQKGKTYYLYETKAK^ 

II: III : II I II |||:| : :: s = * h |s I 

gi-21909640 . DFYLMPGK-YTFVETAAPDGY EVATAITFTVNEQGQVTVNGKA.TKGDAHIVMV 

390 400 410 420 430 440 

1000 1010 1020 1030 

gi-50913505 . KXYQIiPSSGGRGSQIFIIVGSOTAWALIjFYRRQHRKKQY 
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P" C T /' U S O 5 ./' S 7' iS! 3 <J ' ^ 

h |::|r . 

gi-21909640 . DAYK-PTKGSGQVXDIEEKLPDEQGHSGSTTEIEDSKSSDVTIGGQGEWDTTEDTQSGM 

450 460 470 480 490 500 



gi-50913505 .pep 

/home /morama/gas /pi 11/ a 1 ign/gl - 1 3 6 2 1 4 2 8 . pep 

gi 1 13621428 | gb| AAK33238 . 1 | hypothetical protein [Streptococcus pyogenes] 

SCORES Initl: 33 Initn: 33 Opt: . 91 . 2-score: 75.6 E<): 1.1 

>>/home/morama/gas/pili/align/gi-13621428 < pep (340 aa) 

initn: 33 initl: 33 opt: 91 Z-score: 75.6 expect ( ) : 1.1 
Smith- Waterman score: 95; 19.9% identity in 271 aa overlap 

(568-819:57-318) ' 

540 550 560 570 580 590 

gi-50913505 . NKLSVIGFQGSADYHAGKX^ 

: | 5 : : : | : : : || : | : : | | 

gi-13621428 VNGAKI/TVTKNIjDIjVNSNi^ 

30 40 50 60 70 80 i 

600 ,610 = 620 630 640 650 
gi-50913505 . HHNGTNYHAALLIQiK^ ~SGHGS 

-1*1 =:i : :|: I | III-. : I : : : 

gi-13621428 . DKGGSOTKTAEFDFSEVTFEKFGVYYYKOT^ 

90 100 110 120 130 140 

. 660 * 670 680 6.90 70.6 . 

gi-^5 0913505. SlTORNim'RSQEGSKtiAro 

= = ss -III Is "J :JI :h *| 1*1 h ' ■:: 

gi-13621428 . KPVATYIVGYKEGSKVPI-QFKN SLDSTTLTVKKKVSGTCGDRSJmFNFGnTLKAlTQ 

150 160 170 180 190 200 

710 720 730 740 750 
gi- 50913505. YYGITDTAELEKTLN KXVEDSKLSQL GISDS LSQ YVDYYDKQPDVLVT R 

M = .= — III - M | |.. | :f I 

gi-13621428 . Y^£KASRYJJMX1&K^ 

.210 220 • 230 ' .240 250 260 

760... 770 . . 780 790 800 810 

gi-50913505.. KSKMTOETEILYQK^ 

gi-13621428 . SEKYn?TNX^ 



gx-JL J.b^ SEKY3r[EN^/EyS 



...... : 

* ,.* ■■ % 



gi-50913505. SEli*VP^I>E&^ 

^ * o , , . - 4 

• • r . •..:**; ...... 

.^i-13621428. AJP^^^ - *■ - •/.**<«:.•.* 

. 320. 330 * . .340 . . ' . . 

• •• 

#1-50913505. pep 

/home/morama/gas/pili /align/gi-50913 506 .pep 

gi J 50913506 | ref | YP_059478 . 1 1 Fimbrial structural subunit [Streptococcus pyogenes 
MGAS10394]. 

SCORES Initl: 70 Initn: 149 Opt: 86 z-score: 71.3 E(): 1.9 
»/home/morama/gas/pili/aiign/gi-50913506.pep .(556 aa) 
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xmtn: 149 initl: 70 opt: 86 ^-score: 71.3 expect ( } : 1.9 
Smith- Waterman score: 120 7 21.5% identity in 469 aa overlap 
(503-966:33-428) 

480 490 500 510 520 
gi-50913 505 . YSQWVYHGTFDYSSYQGESFimGQIHYRYRGIVSVSDGIRRDDAVKNSLLGVNG L 

I : : II : : : | | : : : : : : : : | I 

ga-50913506 . N RRE T VREKI L I T AKKLMLAC LAI L AVVGLGMT RVS - ALSKDDTAQ LKI TNI EGG P TVT L 

10 20 30 40 50 60- 

530 540 550 560 .570 580 

gi-50913505 . IiQRFVNINPEJSTEQQjSVIGFQGSADYHAGKW^ 

:: 1 I hi- I I s :| : :: f: :: z\ z\ |:: 

gi-50913506 . YKIGEGVYNTNGDSFIKFK : YAEGVSLTE0X3PTSQEIT-TIANGINTGKIKPFSTEN 

70 80 90 100 110 

590 600 6i0 620 630 640 

gi-50913505. LLDPOTI/TALHNNGTira 

: I - - h? : I I I IN |::: : 1 11 : : 

gr-50913506. VSISNGTATYNARGASVYIALLTGAT r — DGRTYNPILLAAS YKGEGNLVTK 

120 13 0 140 150 160 

650 660 670 680 690 700 

gi-50913505. NGSSJNTORNrsTVTRSQEGSKXiAI ' 

I : l = = l -I — I I: h:= h hi- hi : J-* hi 

gr-50913506 . NIDS — KSNYLYGQ — TSVA KSSLPSITfCK^mSTIDDVl^ 

170 180 190 200 210 

710 720 730 740 750. 760 

gi-50913505. EHYYGITBTAELEKTLiraiVEDSKLSQ 

\ J-. I --ll I :||"h s s : I h|: : || 

gi-50913506. LPSY ; TKEAVNKTVY VSDNMSEGDTF — NFNSLTVEWKGKMANITE 

220 230 240. 250 260 

* * * 

770 780 790 800 810 820 

gi-50913505. ILYQKDQVQEAGKDIIDKWFTPKTTSQP^^ 

II-:! =h h h ::| |: 
gi-50913506. -IX5SVMVmrTKIGIAKEVlNlN — LESISPN1— 

270 280 290 . 

830 .840 850 860 870 . 880 

gi-^50913505. AYEKYKDISfiSGKYSEMGDSOT^ . 

•I s -I- -I H r Jh :| | -i : :i| :| j- 

gi-50913506.. ^Y^VtfNtf&A^ ^ V 

360 .;■ - t \!Kttr.- 



: • 920 . . 930;. • • -;. : 

; . ■- J I 4 : I \'J J - - I 1 I : j : : : | | : : J | 1;. : | | | 

.gi-50913506. IVYa^Q^J^RKVDS-VSKTPMG 

- - ' ... 3eo ' • .,?70 . 380 . ; . w .390.. ..-. 400 ' /^viV-^i^^s^'ia^^tf 

- .950 960 ' 97.0; . , 980 990 . 1000 " ' "V 

gi-50913505. LYETKAKX/5YTLPENPWEVAVAHNGDIKV^ 

s I - 1 I" I "I " I. ... 
-gi-50913506. IKELKAPKGYSIJSTTETYEITANXWTATVKTSANS 

410 420 - 430 440" 450 460 

gi-50913505. pep 

/home/morama/gas/pili/align/gi-13 621432 .pep 

gi|l3621432|gb|AAK33241.1| conserved hypothetical protein [Streptococcus pyogene 

FIGURE S7F 
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SCORES Initl: 40 Initn: 65 Opt: 78 z-score: 68.0 E ( ) : 2.9 
>>/home/morama/gas/pili/align/gi-13621432 .pep (450 aa) 

initn: 65 initl : 40 opt: 78 Z-score: 68.0 expect () : 2.9 
Smith-Waterman score: 78; 37.0% identity in 46 aa overlap 

(368-411:14-56) 

340 350 360 370 380 390 . 

•gi-50913505 . KGHNNTINIY IRSTRPLGIiKPIGAAPALIQ PRSFRSLT PRSTKMK — RSAPVTSKFEGELE 

! I s = i hi : HI || 

gi-13 621432 . MTRTI^QKKRMTC P VETED I T YRRKKI KGRRQ AI LAQ FEPELV 

- 10 20 30 40 

400 410 420 430 440 450 

gi-5.0913505 . HHKRIDYLGDNQlSnsrPOTTIDD 

ih I Ih : II 

g-i-13621432 . HHEDI GDSCTCPIX:HGTLTEXGSWQRQELWIPAQLKRIJ^^QHAYKCQT.CSDNSL 

50 60 70 80 - 90 100 

gi-50913.505,pep 

/home/morama/o;as/pili/align/gi-19745301 .pep 

gij 197453.01 1 ref |NP_6 06437 . 1 1 putative collagen binding protein [Streptococcus py 
ogenes MGAS8232] • ■ 

SCORES Initl: 52 Initn: 52 Opt: 73 z-score: 64.8 E(): 4.3 
»/home/morama/gas/pili/align/gi-19745301.pep (524 aa) 

initn: 52 initl: 52 opt: 73. Z-score: 64,8 expect (): 4.3 
Smith-Waterman score: 95; 23 .7% identity in 245. aa overlap 

(759-989:241-466) 

730 740 750 760 770 780 

gi-50913505 . DSKXiSQLGISDSLSQYVDYYDKQPDYXiVTRKSK^/Isro 

~**\\\ H 1 Mil H h:: 
gi-197 45301. ETIDPDFNEGKEIK^THIMADLFSYAKNPRASTNDE — LLSQVKKVX.EKGYRI3 — DSTT 

" 220 230 240 250 2 60 

• • 790 . 800 f • ~/>- •iB10'>;--H^' 820' • 830 • 840 

HSIEGRYSEM 
I I : 

gi-19745301. -YA^ 

>^3d6' -310 3Si0 




„. 86p \.v ; :870; §80 :89.0; . 

gi-50913505 . ■ GBSDEDY—^GO^^ 

* : K h ::|: H .! v ! I M • = : |! I : H • J Ill U 

cO;i -197 45 3 01 • . Nl SI^DFYVPltfSNKYQS^ - 

330 340 . 35b / . 3^0 370 '* 380 

900 910 . .920 930 940 ** 950 . 

•gi-50913505 . K?i/I3ADNNQKKLAGVEFELRK1EDKK 

II =11 v h:h= I : HI H UH =11 l=. I : = : : 

gi-197 4 5301. AD — KKKEFNFEIHLKSSDGQ AI — SGTYPTNSGELT VTDGKATFTDKDGESLI 

390 400 410 420 430 

960 970 980 990 1000 .1010 

gi-50913505 . GYTLPEN— PWEVAVAl^GDIKVKHPIEGELK 

II =■ s|» H H= :=|: - I |: 

gx-197453 01 . VEGLPSGYSYEITETGASDYEVS — WGK-NAPDGiCATKAS VKEDETITFENRKPIiVPPO? 
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450 



460 



470 




480 
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490 



1020 1030 
gi-50913505 . VG SMT AT VALLF YRRQHRKKQ Y 

gi-19745301 . GLTTIXSAXYLWIiliLLVIil^IiWVWIilGRKGLKNI) 

500 510 520 



gi-50913505. pep 

'/home /moraroa / gas /pili /align/gasl5 .pep 
GAS15 GAS15 



SCORES Initl: 43 Initn: 68 Opt; 69 z- score: 61.4 E() : 6.6 
>>/liome/morama/gas/pili/align/gasl5-pep (762 aa) 

initn: 68 initl: 43 opt: 69 Z-score; 61.4 expect () : 6.6 
Smith-Waterman score: 100; 21.4% identity in 252 aa overlap 

(641-873 :492-739) 



620 . 630. 640 650 660 

gi-50913505 . AKEII^STEVICDDGRRKIMIFI SDGVPTFYFGEIXSYRSGNGSSJroRlOTVTRSQ — EGSKLAI 

| (||. : J : : : : : : | : : | : : : I I | 

gas 15 . pep HIAGRDIiFKYTVKPRDTDPDTFIi^^ 

470 480 490 500 510 520 

670 680 690 700 710 - 720 

gi-50913505 . DEF--K^YPI^SIYSLGVSK^^ 

I ; 1 : : • : = | : | : : | : : : : : : | : '7 i : | T : : II 

-gas 1 5 . pep Y^TD'SAELpKDKIiKDYHGFG 

530 540 550 560 5.70 



730 



740 



750 760 770 

gi-50913505 . VEDSK^SQLGISDSLSQYVDYYDKQPDVLVT RKSKVN DETEIIiYQKDQVQEA 

•1 - ::| : : : ||: : || ||: :: |:[: ;: : :::: 

gaslS . pep QSLiIGTQWHPEDLVDIIRMEDKKEV^ 

580 590 . ' 600 610 620 630 

780 790 800 810 820 830 

gi-50913505 . GIODIXDKWFTPKTTSQPK-GKVTI^ 

' :: •"• ■ ■^iMilW*-^ -tf'h'.'i • I- n =i i i r 

gaslS.pep : .- ™ — 

r "\ iy §&0ir"' r-— 650 . tm . 670 



-690 

i*5.0 ■ . 860 . ..:87d 0 -.. 880: 890 

gi r 5 09 13 50 5 . \G^YSE^DSl^DY<31}NQT - -'SSdKXJGLfcSife^^ 

• ... > ^ : ! I ■ :V:| .-:•: >j ; :' J , f£] " • ^ 

•gaslS .pep rQEV^AWSKTGITSI^ 

700 / \ 710 720 730 740 " 750 

'■ ..... .* • ,\ - ' • - 

- " ' ■ : ; ; , SOO . 910 - 920 . 930 940 950 

-gi-50913505 .• VPI^FTKVDADltfNQI^ 



gas 15. pep 



XRKHD 
760 



gi-50913505. pep ' 

/nome /morama / gas /p i 1 i / al i gn /gi -2 19 0 9 6 3 6 . pep 

gij 21909636 1 ref | NP__663904 . 1 1 conserved hypothetical protein [Streptococcus pyoge 
nes MGAS315] 
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SCORES Initl; 31 Initn: 31 Opt; 62 z-score: 60.8 E(): 7.1 
>>/home/morama/gas/pili/align/gi-21909636.pep (344 aa) 

initn: 31 initl: 31 opt: 62 Z-score: 60.8 expectt): 7.1 
Smith-Waterman score: 71; 22.9% identity in 131 aa overlap 

(181-305:176-298) 

160 170 180 190 200 210 

gi - 50913505. DI TTTIiTFKNGLiSIjEG ASTE1ANDPJSTVRVG IVNPiroTVQT I TPTIKQ DAIX?K^/K1^VFTGR 

II |:: 4 : =| ||: 

•gr-21909 63 6 , VDVYVGNTCEGGGF E PKF I VSKEQ GTDVKK P VNFNNS FAT T S LKVKK3WS GN~ TGE 

150 160 170 180 190 

220 230 240 250 260 

gi-50913 505 . LGKQ VTIVSTTRLKEEQTISIiDSYGELvIDGAVGLSQKXJRPPYSKPITVNILKPKIi 

I h : = r| =i|:i |: :: :| | : :: | :: | : 

gi -21909636. tiQKEFDFTLTLNESTNFKKDQI VSLQK-GlSTEKFEVKIGTPYKFKLKNGESIQLDKIiPVGI 
. 200 210 220 230 240 250 

270 280 290 300 310 320 

- gi-50913505 . SS I ESSLDSKI>FE I VKTIDNLYTWDDQ — F YIAvDFISKQYEVLKTDYQ SAKDSTPQTRDI 

. = = — * y II =1 I I s | ||: : I I 
gi-21909 63 6 . TYKVNEMEANK-IX5YKTTASLKEGI^ 

260 270 280 290 300 310 

* 

330 340 350 360 370 380 

gi-50913 505 . . hFGE YTVTSPLVMNKGHNNTIN^ 

gi -219 09 63 6.. VGT L APF AVDS I VAI GG VI YI T KRKKA 

320 330 34.0 

gi-50913505 .pep 

/home/morama/gas/pili/align/gi-2 8810259 .pep 

gi J 28810259 | dbj |BAC63197.1| Hypothetical protein [Streptococcus pyogenes S5I-1] 

SCORES Initl: 31 Initn: 31 Opt: 62 z-score: 60.7 E(): 7.2 
> > /home /morama/gas /pill/ a li gn/ gi - 2 8 8 1 0 2 5 9 . pep (349 aa) 

initn: 31 initl:. 31 opt: 62 Z-score: 60 .7 expect(): 7.2 
Smith-Waterman score: 71; 22 . 9% identity in 131 aa overlap 

(181-305:181-303) \ * . ' 

• • * . • ... 

. * 3.50 , i^0 ./i' * 180 . .1^0, 200 : .-21% '* • 

gi-50913 5'05; ^DirraTi^^ 

■ Ivl* I : . : . -'|,\ * ■ :■ | : .: : ••■•|< » ■ ° *,..•• . J*'!; - *" •■.*.■ 
gi-28810259. *VDVW<3l!JKEX^FEP^±VS — ^OJ&E- • 

. . 160 170 ^L80 190 . 200 

* ' . ■ ...;..-* 

220 ; 230- ; .240 250 .260 / 

gi-50913505 . LGKQ VIWSTTJ^KEEQTISIjbSYG^^ 

| | : : : | . z \zz | z \ \ zz | : : : : | \ : : : \ zz | : 

gi-28810259 . IiQKEFDFTIiTUiJESTIWKKDQr^^ 

* * 210 ' 220 ' 230 240 250 260 

270 280 290 300 310 320 

gi-50913 505 . SS I ESSLDSKDFEIVKTIDNL.YTWDDQ — F YLLDF I SKQ YEVLKTDYQ SAKDSTPQTRDI 

= = . * 11=1 II :| || = * - I I 

gi -28810259. TYKV1TEMEANK-DGYKTTASLKEGEX3Q SKMYQLDMEQKTDESADEIVVIWRDTQVPTGV 

270 280 290 300 310 320 
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330 340 350 360 370 380 

gi-50913505 . IiFGEYTVEPLVMNK^^ 

gi-28810259. VGTLAJPFAVLSIVAIGGVIYITKRKKA 

330 340 

gi-50913505. pep 

/home/morama/gas/pili/aligxL/gi-19224139 .pep 

■ 

gl 1 19224139 | gt> | AAL86410 . 1 1 AF447492_7 unknown [Streptococcus pyogenes] 

SCORES Initl: 43 Initn: 43 Opt: 54 z-score: 58.9 E() ; 8.9 
>>/home/morama/gas/pili/alig^i/gi-19224139 .pep (189 aa) 

initn: 43 initl: 43 opt: 54 Z-score: 58.9 expect ( ) : 8.9 
Smith-Waterman score: 54; 31.6% identity in 57 aa overlap 

(742-796:90-143) 

720 730 740 750 760 . 

gi-50913505 . GXTDTAELEKTIJ^IVEDSKX.SQLGISDSLSQYVDYYDKQP DVX.VTRKSKVNDETEIL. 

II II til! : II I 

gx-19224139 . ITIAGSGKASFSPLTFTTOGQYTYRVYQKPSQNKDYQADTTOTDVLVYV TYDEDGTL 

60 70 80 90 100 110 

770 780 790 800 .810 820 

gi-50913505 . YQKDQVQEAGKDIlBKVvl?TPKTTSQP 

I ^IJ : • —I II :| 

gi-19224139 - VAKVT SRKAGDEEKSAITFKPKRIiVKPIPPRQPNIPKTPrjPLAGEVKSX^IX?XLS IVLLGLr 

120 130 140 150 160 170 

gi-50913505 .pep 

/home/morama/gas/pili/align/gi-19745305 .pep 

gi (19745305 | ref |NP_606441 .1 1 hypothetical protein [Streptococcus pyogenes MGAS82 
32] 

SCORES Initl z. 43 Initn: 43 Opt:. 54 z-score: 58.8 E(): 9 
>>/home/morama/gas/pili/align/gi-19745305.pep (195 aa) 

iinith: 43 initl: 43 opt: 54 Z-score/: 58.8 .expect () : 9 
SM^-Waterrnan score: 54; 31 . 6% identity in 57 aa overlap 
(742-79^:96-149)* .. * > V ' 'IT. V "/ s " 



* 



gi-^09i3:50S^ ^ 

goi-l 97453 0 5 v I^IAGSGp^SBLra^ 

70 . 80 90 . i00 - iiO 120 

B 

» ff • • ■ * 

5 * * ♦* * * 

• . - . # * ■ « • .* s » • 

. 770 -780 „. 790 . . . . 800 . 8±0 ; . . 820 

gi-50913505 . YQKDQyQteGKDIiDKVWTPKTTSQPKG 

I ::H : = *H II vl 
gi-197.45305 . YAKVISRRAGDEEKSAITra 

130 140 150 160 170 180 

gi-50913505. pep 

/home /morama/gas /p i 1 i /a 1 ign/ or f 8 2 . pep 

TRANSLATE of: orf82.seg check: 4296 from: 1 to: 672 
generated symbols 1 to: 224. 
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Jltll, 'l'IM« j« jj .,(.11, llllltl ,« >"'t|i «<!«[, • "«•«, |{1IM. 

m „ PL. II ./ U !b U ib ,/ jri!./ d! ..::;(! yi 

GETSEQ from morama , September 13 , 2004 17:09, 

SCORES Initl: 43 Initn: 43 Opt: 54 2: -score: 58.2 E(>: 9.6 

>>/home/morama/gas/pili/align/orf 82 .pep (224 aa) 

initn: 43 initl: 43 opt: 54 Z-score: 58,2 expect { ) : 9.6 

Smith-Waterman score: 54; 31.6% identity in 57 aa overlap 

(742-79 6:125-178) 

720 730 740 750 760 

gi-50913505 . GITOTAELEKTIJSKIVEDSKLSQ^ DVLVTRKSKVNDETEIIi 

11 II mi =111 

orf 82 . pep ITIAGSGKASFSPIjTFTTVGQYTYRvYQKPSQISr^^ TYDEDGTL 

100 110 120 130 140 150 

770 780 790 800 810 820 

gi-50913505 . YQKDQVQEAGF£>IIDKVWTPKTTSQ 

I ::|| : : ;:| li =| 

orf 82 .pep VAKVI SRRAGDEEKSAI TFKPKRIjVKPI PPRQPNIPKTPIiPL^AGEVKSIjI^IIjSIvXrljGLi 

160 170 " 180 190 200 210 

gi-50913505. pep 

/home/morama/gas/pili/align/gi-il969638.pep 

gi| 21909638 |ref|NP_663906.l| hypothetical protein [Streptococcus pyogenes MGAS31 
5] 



SCORES Initl : 43 Initn: 43 Opt: 52 z-score: 58.0 E ( ) : -9.9 
»/home/morama/gas/pili/align/gi-21909638ipep (183 aa) 

initn: 43 initl: 43 opt: 52 Z-score: 58,0 expect ( ) : 9.9 
Smi th-Wa t erman score: 52; 31.6% identity in 57 aa overlap 

(742-796:84-137) " : 

720 730 740 750 760 
gi-50913505. GITDTAELEKTIiNKIvEDSKIiSQIjGISDSIiSQYVDTYDKQP DVIiVTRKSKVWDETEIIj 

II II llll : II I 

gx-21909 638 . I T I AGSGKASFS PLTFTTVGQYT YRVYQK^ — r - TYDEDGTL 

60 70 80 90 100 110 



770 - 780 790 . .800- : ,810 - . 820 

gi-5 0 9 13 5 05 , YQIDDQVQI^GKDIIDKVVF^ 

1 ••«■*• r I'* * . .***.*'•' "i II . *.i ••**... ■ . * m .1 • .. 
gi r-219 09:63^ ; VafcvM^ 



• , . 



t- I" 



170 



Distributed over 1 thread.- 

Start time: Wed Sep 15 18:46:18 2004 
Completion time: Wed Sep 15 18:46:21 2004 

CPU time used: . 

Database scan : .0:00:00.1. 
Post-scan processing: 0:00: 02 . 2 
Total CPU time: 0:00:02.3 
Output File: gi-50913505 . fasta 
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p jr x / 1 [i n k; /*' ir' } 'T ;p' s ■•■!! >qt 

n sequenceIlist irlfff"^ ■ » ■»»!' J { 

(Peptide) FASTA of: gi-50913506 .pep from: 1 to: 556 September 15, 2004 18:45 

gi 1 50913506 1 ref | YP„059478 .1) Fimbrial structural subunit [Streptococcus pyogenes 
MGAS10394] 

TO: *.pep Sequences: 56 Symbols: 22, 803 Word. Size: 2 

Scoring matrix: GenRunData:blosum50 .cmp 
Variable pamf actor used 

Gap creation penalty: 12 Gap extension -penalty: 2 



Histogram Key: 

Each histogram symbol represents 1 search set sequences 
z-scores computed from opt scores 
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Joining threshold: 37, opt. threshold: 25, opt. width: 16 , reg. -scaled 

The best scores are: initl initn opt z-sc J5(55).. 

.j . 

/home/morama/gas/pili/align/gi-50913506 .pep Begin: 1 End: 556 

-! gi |50913506|ref | YP_0 5947 8.1 | Fimbri . . . 3454 3454 3454 1016.2 4.7e-53 
/home/morama/gas/pili/align/orf 84. pep Begin: 316 End: 567 

! TRANSLATE of: orf84,seq check: 7868 57 83 135 75.1 1.2 

/home/morama/gas/pili/align/gi-19745307 .pep Begin: 316 End: 567 

! gij 19745307 |ref|NP_606443.1| protei 57 83 135 75.1 1.2 

/ home /mo rama / gas /p i 1 i /al ign/gi -21909640. pep Begin: 202 End: 524 

I gi 1 21909640 lref|ilP_663 908.1 1 protei... 56 81 134 75.0. 1.2 

/home /mor ania / ga s /p i li / al ign/ gi -28810263. pep * Begin: 316 End: 63 8 

i gi|28810263 |dbj |BAC63201.1| protein... ~ 56 82 134 74.7 1.3 

/home/morama/gas/pili/align/orf 80 -pep Begin: 49 End: 352 

! TRANS LATE of: or f 8 0 . s eg check : 9824.... 45 69 113 70.8 2.1 

/home/mo rama/gas/pili/align/gi -19224137 .pep Begin: 25 End: 342 

! gi |19224137 |gb| AAL86408.1|AF447492„. .-. 45 69 ±09 69.8 .2.4 

/home/morama/gas/pili/align/gi-19224141 .pep Begin: 277 End: 645 

I gi j 19224141.) gt>] AAL86.412 i 1 J AF447492_. . . 73 73 118 68.9 2.7 

/home /morama/ gas /pili/align/gi-21909 63 6 . pep Begin: 44 End: 344 

! gi 1 21909636 |ref |NP_663904.1 1 censer... 45 98 96 66.1 3.8 

/home/morama/gas/pili/align/gi-288i0259.pep Begin: 49 End: 349 

J gi|28810259|dbj|BAC63197.1| hypothe. . . - 45 98 96 66.0 3.8 

/home/morama/gas/pili /align/gas 15 .pep Begin: 222 End: 470 

! GAS15 GAS15 ... ..42 68 96 63.8 5 

/home /morama / gas /pill/ al i gn/gi -13621428. pep Begin: 17 End: 340 

I gi j 13621428 ]gb | AAK33238.il hypothet 41 41 .87 63.6. 5.2 

./hbme/moraina/^ : Begin: 193 End: 462 

;.«" 'gi j 1.9224135 :j:g^ | AAL86406^ J^447^2_. ^V"' V- 41 .41 . 86 .61.0. . 7. 

./h6^e\/mora^ /BegihV 503 End: §66 

>? gi J5031^0:5:|:r^| t^0§9#7 j i ,| ;^oiiag; ;'£. } J ■ 70 , \* J&S y>-j&6 ; v : 60. .a., . . .7.8 

Begin: 60 End: 143 : -/... *. 

i ^^42^ 43 67 67 59.~2 " 8-7 

'vSoiae/^ Begin: 44 End: "344 

* g£ 1.19745303 ^ 10£ 69 5814 9;5' 

•WEnia of List . \ - V. . , . » 



gi-50913506.pep : - . . 

/home /morama/ gas /pili/align/giT50913 50 6 . pep 

gi | 50913506 1 ref | YP__059478 . 1 1 Filribrial structural subunit [Streptococcus pyogenes 
MGAS10394] 

SCORES Initl: 3454 Initn: 3454 Opt: 3454 z-score: 1016.2 E( ) : 4.7e-53 
»/home/morama/gas/pili/align/gi-50913506.pep (556 aa) 

initn: 3454 initl: 3454 opt: 3454 Z-score: 1016.2 expect ( ) : 4.7e-53 
Smith-Waterman score: 3454; 100.0% identity in 556 aa overlap 
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p if T / ij |i cj in iq; / ;p -7 :;:::u s, :::i> c?i 

(1-556:1-556) 



10 20 30 40 50 60 

gi-50913506 . MTNRRETVREKI LITAKJRXiMIjAC IJAIIlAVVGIiG^^^RVS ALSKDDT AQIjKITNI EGGPTVT 

liiiiiiiiii.iiiiiiiiiiiiiiiiiiiiiiiuiiiiriiiiiiiiiiiiiiiiiii 

gi-50913506. JXPTlsrRRETVREKILiITAK^KLMIjAC LAI LAVVGLGMTRVS ALSKDDT AQ LKITNI EGGPT VT 

10 20 30 40 50 60 

70 80 90 100 .110 120 

gi-50913506 . LYKIGEGVYNTNGDSFiDreiCYA^ 

1 1 1 ! 1 1 1 1 1 1 1 1 1 ! ! I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

gi-50913506 . KTKIGEGVYNTNGDSFIlSnFKYAE^ 

70 80 90 100 110 120 

* * 

130 140 150 160 170 180 

gi-50913506. NGT AT YNARG AS VYI ALLTG ATDGRT YNP I LLAASYNGEGI^VTKNIDSKSlSri'XjYGQTS V 

l!!lll!l!lil!illllllliliil!ll!liilll!!lllll.lllllitllll|lllll 

gi-50913506. NGTATYNARGASVYIALLTGATIX^RTYN 

130 .140 150 160 170 180 

190 , 200 210 220 . 230 240 

gi-50913506 . AKSSLPSITKKOTGTIDDVKKKT^ 

I I I I I I 1 I i I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 1 I I I I I I 

gi-50913506 . AKSSLPSITKXVTGTIDDVNKKTTSL^ 

190 200 210 220 230 240 

250 260 270 280 290 300 

gi-50913506 . TFNFNSLTVEWKGKMANIT^ 

Nil Ml IN II f 1 1 1 1 1 J 1 1 1 1 1 1 1 J 1 1 1 II 1 1 1 1 1 1 J 1 4. II 1 1 1 1 J II I J 1 1 1 1 1 1 1 1 

gi-50913506 . TFNFNSLTVEWKGI34AN 

250 260 270 280 290 300 



310 320 -330 340 350 360 

gi-5 0 913506. AVVNlSfiCAX VGEEGNPISTECAE I T SKEDS KI VYT YQ ± A 

N 1 1 II I II 1 1 1 II II I N I II I i I II II 1 1 1 1 1 II I II I II 1 1 1 II II 1 1 II II 1 1 ! 1 1 

gi-50913506. AVVNNKAXVGEEGNPNKAEFFYSI^ 

310 320 330 340 350 360 

- 

370 380. 390. 400 410 420 

gi-5 091350 6 . FRKTOSVSKTPIilGAOIFGVYOT 

Ill-Ill II I III II 1 1 II 111111 N II IN Nil 1 1 1 II NII1 III Nil N I III 1 1 

', <^i -545 913506.. FRKVDS VSK^.PLIGAXFGWDTS^ STQVSSGKYKlKiSijKAPKGYS 

370 380. 390 400 .410 420 

r « -^30 . - - .44Q ; %450 ^ 460 .... . . .470 . ...480: 

v v V.-;..vv mkV\iW&ii&-&iffiMWfflM ill ill fMfffi f-k u HxMiM* 

gi^5q.9i3506- p?TE3?YEITAf^ - 

' >v 430 440 4:50 * 460 • 470 480 

.-'••*'•..-** ~ ■ 

- 

* •* * * 

. 490 .500 510 . . 520 .530 . 540 

-gi-50913506. DVIQSAYIESTKAIiTIXIJTT . 

: 1 1 II II I II I N II I UN 1 11 N N III I III 1 1 1 11 1 1 III N N I N 1 1 1 i 11 1 Ml 

gi-50913506 . IWKEAYIESTKAMIXSTTF^ 

490 500 510 520" 530 540 

. 550 • 
gi-50913506. AMIGAIGIYXVKRRKA 

INN II I II Nllll 

gi-50913506. AMIGAXGIYIVKRRKA 

550 
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gi-50913506.pep 

/home/morama/gas/pili/align/orf84.pep 

TRANSLATE of: orf84.seq check: 7868 from; 1 to: 2088 
generated, symbols 1 to: 696. 
GETSEQ from morama, September 13, 2004 17:07. 

SCORES Initl: 57 Initn: 83 Opt: 135 z-score: 75-1 E(); 1,2 
>>/home/morama/gas/pili/align/orf 84 .pep (696 aa) 

initn: 83 initl: 57 opt: 135 Z-score: 75.1 expect ( ) : 1.2 
Smith-Waterman score: 146; 24.4% identity in 262 aa overlap 

.(232-462:316-567) 

210 220 230 240 250 

gi-50913506 . KTTSI^SVLSYSLTFELPSYTKEAVI^ 

I . i . . i . * . . . I . . i . . . 

orf 84 .pep EKEVAVDDAELKQ INSEGQQEI SVTWTNQLVTDE — KGMAYIYSVKEVDKNGELLEPKDY 

290 300 310 320 330 340 

260 270 280 290 300 . 310 

gi-50913506 - XTlSJySSVMVEm^JOiGXJ^KB^m^GFm^SFXY : -DSLESISPNISYi^WlStt?KAIVGEE 

I - I II s | ::: ::: : | J: s|s| :: : | | ||: 

orf 84. pep IKKEDGLrTVTNTYV KPTSGHYDIEVTFGWGHIDITEDTTPDI-VSGENQMKQIEGED 

350 360 370 380 390- 

» 

320 330 340 350 

gi"-50 9.1 3 50 6 . GNP— — NKAEFFYSNNP -TKGNTYDNL-DKKP-DKGNGITSKEDSKIVYTYQ 

: = l I II - I |;:J \zts |::| | i| S *|-s: :: :| : 

orf 8 4 . pep SKPIDEVTE^nsnjIEFGKNTM^^ 

400 410 420 430 440 450 

360 370 380 390 400 410 

gi-50913506. IAFRKVDSVSKTPIiIGAIFGVYX>TSS^ 

MM-: I II = = hh| |: | :t| 5 j : ||| : | 

orf 84 .pep EDSATHIKFSKRD-IDGKEI^VGATMELRDSSGKTIS — TWISD^VEI^FYIJvrPGKYTFVE 

460 470 480 490 500 510 

. 420 430 440 450' 460 470 

gi-509i3506. IiKAPKGYSLOTE-TYEIT^ 

II II * I h ^ s ||:: \tt | | | : |. || - 

orf 84 .pep TAAPIX^YE-IATAITFTVl^ 

520 .5.30 540- 550 560' 570 



. v -.490 . ,.49[0 .500 : 510; . ; : " \ 520 

g£-SO91350iL YSI|#I^G3!3p^ - *-:^-:v^:^- 

prf.84.pep DEQGfj^ 

580 590 60*6 isiO 



gi-50913506. J>ep 

/home/morama/gas /pili /align/gi-19 7 4 5 30 7 - pep 



m * a 

gi 1 19745307 J ref |NEL;606443 .1| protein F2-like protein [Streptococcus pyogenes kGA 
S8232] " • 

■ * 

SCORES Initl: 57 Initn: 83 Opt: 135 z-score: 75-1 E() : 1.2" 
»/iiome/morama/gas/pili/align/gi-l9745307 .pep (696 aa) 

initn: 83 initl: 57 opt:' 135 Z-score: 75.1 expect () : 1.2. 
Smith-Waterman score: 146; 24.4% identity in 262 aa overlap 

(232-462:316-567) 
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210 220 230 240 250 

gi-50913506 . KTTSlX5SVIxSYSLTFELPSYTKEAV3SrKTVYVSDN]y^ 

I • I • z I Z Z Z Z Z I * " j * * * 

gi-1 97453 07 . EKEVAVDDAELKQINSEGQQEISVTWTNQLVTDE — KGMAYIYSVKEVDKNGEI.LEPKDY 

290 300 310 320 330 340 

260 270 280 290 300 310- 

gi-50913506 . ITEDGSVMVEOTKIGIAKEVlSnsrGFlSrL,SFIY- DSLESISPNISYKAVVNNKAIVGEE 

1 :: 111: I = : | J: :|:| : | | | | : 

gi-19745307 . IKKEDGDTVTNTYV KPTSGHYDIEVTFGNGHIDITEDTTPDI-VSGENQMKQISGED 

350 360 370 380 390 

320 330 340 350 

gi-50913506 - GNP NKAEFFYSNNP TKGNT YDNX, - DKKP -DKGNG ITSKEDSKI VYT YQ 

: : I I II :s I h : I I : : : | : : | | :)::(:: : : z\ : 

gx-197453 07 . SKPIDEVTENr^IEFGKNTMPGEErX^TNSNKYEEVEDSRPVDTLSGLSSEQGQSG 

400 410 420 430 440 450 

'J 360 370 380 390 400 410 

gi-5 0913 506. IAFRRVDSV S KT PLIG AI FGVYDTSNEIIj ID I VTTJNKNG YAI STQ VS SGKYKI KE 

I I I I I II : : |:|:| |: | ::| : : : | | | J 

gx-19745307. E DS ATH I KF SKRD— I DGKE LAG ATME LRD S SGKT I S TWXSDGQVKDFYLMPGKYTFVE 

460 470 . 480 490 500 510 

420 430 440 450 460 470 

gi-50913506 . LKAPEGYSUXTTE-TYEITAN-WOT^ 

II II = I l=. * II-. |:: :: | | | :| || 

gi-19745307 . TAAPDGYEIATAITFTVNEQGQVTVN^ 

520 530 540 550 560 570 

. 480 490 500 510 • 520 530 

-gi-50913506. YSIDSRPTGIsTOVKEAYIESTKAIjTIXS 

gi-19745307 . DEQGHSGSTTEIEDSKSSDVIIGGQGQIVETTEOT 

580 590 600 610 620 630 

gi-50913506. pep 

/home/iaoraina/gas/pili/align/gi-21909640.pep " 

m | ^1909640 J aref |NP_663908 ,1| . profceia F2 like, fibronec tin-binding protein tStrept 7 * ' 
ocoecus pyogenes MGAS315 ] 




... * 23.0 220 - 23.0 240 250 

gi-:509l3506 . KTTSLGSV^SYSLTFELPSYTK& -: _ 

I * 'J 3 2 j Z I z z z I z z . J z z • I 

gi -21909640 EKEVAVDDAELKQIJSTS^ 

180 190 200 2i0 220 

260 270 • 280 • . 290 300 310 

gi-50913506 . ITElX5SVMVENTKIGIAKEVlMGFlsni»SFIY- DSLESISPNISYKAVVNNKAIVGEE 

| :: :: I J| : | 1 : ; zzz : | | : : j : j : J | || : 

gi-2 1909640. IKKEDGLTVTNTYV KPTSGHYDIEVTFGNGHIDITEDTTPDI -VSGENQMKQIEGED 

230 240 250 260 270 280 
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utii. .ftni, ■«M«<« ..' jl M utlu, .jit. t , iititti ,t .(.«»(, uiijl! «<•>{, ■•■<.{■ ,iimi, 

PL. II U:3U:;:ii/ id; y c::! .d! yi . 

320 330 340 350 

gx-50913506 . GNP NKAEFFYSNNP TKGNTYDNL-DKKP-DKGNGITSKEDSKIVYTYO 

. _ :z l I II ^- I h:| h--- |::| | tlisjx-. :': :| : 

gi-21909640 . SKPIDKVTENlSn^ 

290 300 310 320 330 340 

360 370 380 390 400 410 

gi-50913506 . lAFRKVDSVSKTPLYTGAIFGVYDTSI^ 

• o.ono^n MM— Ml hh| h I ::| s s = ||| : | 

gx-21909640 . EDSATHIKFSKRD-IDGKEtxAGATMELRDSSGKTIS TWISDGQVKDFYLMPGKYTFVK 

350 360 370 380 390 400 

420 430 440 450 460 470 
gx-50913506 . LKAPKGYSLNTE-TYEITAN-WVTATWTSANSK^ 

II II : I h ss : ||:: |:= :: ::: | | | z \ || 

gx-21909640 . T AAPDGY WATAI TFTVNEQGQ VTVNGKkT^ TKG SGQ V 

410 * 420 430 440 450 

480 490 500 510 520 530 

gi-50913506 • YSIDSRPTGITOVKEAYIESTKAIjTIX^ 

. ^ H : I = II : h : • . |:| |». |: :|: | :|| 

gx-21909 64a< — IDIEEKLPD-EQGHSGSTTEIEDSKSSDVII^ 

460 470 480 490 500 

540 550 
gi-50913506, TYLFKAIGSAAMXGAIGIYIVKRRKA 

| : : :| ::||: | 

gi-219[£9 640 , TEIEDSKSSDVIIGGQGEVVDTTEL^TQSGMTGHSGSTTKIEDSKSSDVIVGC^GQIVETT 

510 520 • 530 540 ' 550 560 

gi-50913506. pep 

/iioiae/morama/gas/pili/align/gi-28810263 .pep 

gi| 28810263 | dbj.| BAC63201.1) protein F2-like protein [Streptococcus pyogenes SSI- 

SCORES Initl: 56 Initn: 82 Opt: 134 z-score: 74.7 E() ; 1.3 
>>/home/in6raiiia/gas/pili/align/gi-2881d263;pep (733 aa) 

initn: 82 initl: 56 opt: 134 Z-score: 74.7 expect () : 1.3 
.i^th-Waterroaii- "seore: 155; . 23 r; 6% : iGlentity * in'" 347 ' aa b^erlcm * v • 

.(232-547:316-638) 




,310 320 336 ; 340 



■ . . 260 • 2-70 -280 *• \r ;• "2-90 -.*•*■»--■ ^OQ ' , ^^S ,, '3i(y ,j, * , * ,s> -' , ''**' , »-^*'* 

gi-50913506. XTT&DGSVMVEI^KI^^ 

. - I " I 11= I «i =• . I |: : | : | ^ M l | | : * 

gx-28810263, IKKEDGLTVTNTYV KPTSGHYDIEVTFGITGHIDITEDTTPDI-VSGEISrQMKQIEGED 

350 360 370. 380 390 



320 330 340 350 

gi-50913 506 . GNP NKAEFFYSNNP TKGNTYDNL-DKICP-DKGNGITSKEDSKIVYTYQ 

I I II - I IM h:= h = | | = h=M * = =1 r 

gx-2 88 10 2 63 , SKPIDEVTE*miiIEFGKKriWGEEDGTN^^ 

400 410 420 430 440 . 450 

360 .370 380 390 400 410 
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gi-50913506 . --IAFRKTOSVSKTPLIGAIFGVYD^ 

MM- MM : |:|:| |: | : = M : = MM I 

gi-28810263 . EDSATHIKFSKRD- IDGKELAG ATMELRDSSGKTI S TWI SDGQVKDF YIxMPGKYTF VE 

460 470 480 490 500 510 

420 430 440 450 460 470 

gi-5 0913 506. LKAPKG Y S LNT E - T YE I TAN - WVT AT VKT S KS TT YTS DE^TK ATDNS KQ VG WLKtSTG I F 

Ml MM h : ||:: |:s :: ::: | | MM i 

gi-2 8810263 . TAAPDGYE\^ATAI TFTVISTEQGQV^WGKIATKGDAHIVMV'- DAYKPTKG SGQV — 

520 530 540 550 560 



.480 490 500 510 520 530 

gi-50913506 . YSIDSRPTGNDVKEAYIESTKALTDGTTF^ 

IM I II z \z z t |:( |: | : :|: I :|| 

— IDIEEKIjPD-EQGHSGSTTEIEDSBCSSDVIIGGQGEWDTTE — DTQSGMTGHSGST- 
570 580 . 590 600 610 620 



-gi-28810263 



^ri-50913506 
gi-28810263 



540 550 " 

TYLFKAIGSAAMIGAIGIYIVKRRKA 

M : :| I 

TKIEDSKSSDVIVGGQGQIVETTEDTQTGMHGDSGRK 

630 640 650 . 660 670 680 



g-i-50913506.pep 

/home/morama/gas /pili/align/orf 80 . pep 

TRANSLATE of: orf 80 .seq check: 9824 from: 1 to 
generated, symbols 1 to : . 352 . 
GETSEQ from raorama , September 13 , 2004 17:11. 
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SCORES Initl: 45 Initn: 69 Opt: 113 z-score: 70.8 r E() : 2.1 
>>/liome/moraiaa/gas/pili/aligii/orf 80 .pep . (352 aa) 

initn: 69 initl: 45 opt: 113 Z-score: 70.8 expect (} : 2.1 
Smi tli-Waterman score: 123; 22.8% identity in 311 aa .overlap 

(284-556:49-352) 



gi-50913506 
or f 80. pep. 



260 270 280 290 . 300 . 310, 

KMAISTCTEDGSVMVEOTK^ 



20 



30 



40 



-jk&ft 

.80 ' : "'96 ' V "too 



l\- 1 : I:. 



.1.4 I 



50 60 ; . ; .70 . 

: ;>r340 v . ■ .... , .:3S0v# •^3^^ , ^S?fc:a : v c 379 



.. .380 , 



0i-5D9135O6 



orf 80. pep 



... -'390 . • ■* v-V V 4OO-4-- ^«s£^ ; 4l6- 4^?W'^-*'420;'. 

LIGAIFGVYDTSNKM f~YAISTQ-VS£fGK^Il^'r2KAPKGYSl^ 

KDGI TYDDKKWTVDVYVGNKAI^ 

140 150 160 170 .180 :, ,190 



gi-50913506 



orf 80 .pep 



.430 * 440 450 460 
TETYEITANW-- VTATVKTSANSKSTTYTSDKNKATDNSEQVG WLKNGI 

I - ! ! = : I h : : :* = :: :j | :J 

lEK-QITGNAGDRKKSFOTTLTI^PSEYYKTGSV^ 

200 210 220 230 240 250 



470 



480 



490 



500 



510 
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gi-50913506. FYSIDSRPTGNDV KEAYI ESTKALTDGTTFSKSNEGSGTVLLETDI ' 

::: I I - h = l h I hi :|- - I : 

or f 8 0 . pep SVMLSKXPIGINYYLSEDEAJSTKXK3YTTTA 

260 270 280 290 300 310 

520 530- 540 550 

gi-50913506. POTKI^ELPSTGSIGTYT.FKAIGSAT^^GAIGIYIViaiRKA 

I = -t II : i I .hh II- Nhlhl! 

orf 8 0 . pep TNKRDTQ VP -TG WGTLAPF AVLS IVAIGGV- 1 YITKRKKA 

320 330 340 350 

gi-50913506. pep 

/home /niorama /gas /pil i /align/gi- 1 9224137, pep 

gi| 19224137 |gb|AAI,8 6408. 1|AF447492_5 EftLSL.A [Streptococcus pyogenes] 

SCORES Initl: 45 Initn: 69 Opt: 109 z-score: 69,8 E() : 2.4 
>>/home/morama/gas/pili/align/gi-19224137 .pep (342 aa) 

initn: .69 initl: 45 opt: 109 Z-score: 69.8 expect { ) : 2.4 
Smith-Waterman score: 169 ; 26.0% identity in 334 aa overlap 

(257-556:25-342) 

230 240 250 260 270 280 

gi-50913506 . mTVYVSDMtfSEGIiTFN^ 

I : : : : | : : : | | : : | j | 
gi-19224137 . MK30SrRXLI,AT^^ 

10 20 30 40 

-* . 290 300 310 . 320 330 340 

gi-50913506 . YDSLESISPNISYKAVVN-NKAITO^ 

gi-19224137 . DDTIi— — LMPKTDYTFSVNPDSAATGTESN 

50 60 70 80 90 100 

350 . 360 370 380 390 400 

gi-50913506. . ITSKEDSKIVYTYQI AFRKVDSVSKTPIjIGAIFGrV' - YDTSNKLIDIVTrTNKNGYAI STQV 

= ; I — * I I - - I h II II * S hh : h: =: . * 

gi-19224137. WV-DFMKVTFPSVGIYRYVVTENK- GTAEGVTYDDTKmVDVYVGNNEKGGLEPKY 

110 120 130 140 150 160 



* „, . / 41*0; 420 , .430 . 440 . 450 

giv-509i3506.. .ESGmi&F&Bl^^ 

210 iio 



. .460 . 470 !, ; £80 490 

gi-50913506 .. - - — EQ VGWLKIT- — GIFYSI- — DSR— — -— PTGND — VKEAYIE — : — • STKALTDG 

.,. J ; -j ,| , : •-^ .j j | hi : hll - ' I H II 

gi-19224137 -WIEENGQTi^VKi^ 

- " : 230 .240 : 250 r 260 270 "280 

*'-*.-"".**- ". - * 

. 500 . . 5i0 : 520 ' 530 . 540 550 

gi-50913 506 . TTFSKSNEG- SGTvIiIiETDIPOTKI^EIiPSTGS 

: | J j : ': | : |: : : : | j j : | | |: I : I I s : 1 I I : I 

gi-19224137 . EKLSTYEIXiGQEHkTDKT^ 

290 30.0 310 320 330 

gi-50913506. KRKA 

hll 
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gi-19224137. RKKA ' 

340 

gi-50913 506 .pep 

/home/morama/gas/pili/align/gi~l 9224141 .pep 

gi 1 19224141 1 gk>| AAL86412 . 1 j AF447492_9 protein F2 [Streptococcus pyogenes] 

SCORES Initl : 73 Initn: 73 Opt: 118 z-score: 68.9 E(): 2.7 
>>/home/morama/gas/pili/align/gi-19224141.pep (1161 aa) 

initn: 73 initl: 73 opt: 118 Z-score:. 68.9 expect () : 2.7 
Smith-Waterman score: 174; 23.9% identity in 406 aa overlap 

(115-483:277-645) 

90 100 110 120 130 140 

01-50913506 . VS I/F ETG P TS Q E I TT I AUG 1NTGKI KPF STENVS I SNGTAT YNARG AS VY — IALLTGAT 

I hill I H ::: : : :| ] 

01-19224141 . IYTFTDYIAGLDKVQLSAELSDFLFJSTKE 

250 260 270 280 290 300 

. ■ 

150 160 170 180 190 
gi-50913506 . DGRTYNPIIiLAASYNGEGigXiV^^ T 

-•■II : I I h- - I I :: ::::| | :: | | 

01-19224141 . STKESNYITNGLS1WG-GSIESYNTETGEFV 

310 320 330 340 350 360 

200 210 220 230 240 250 

gi-50913 50 6 . ID— -DVNKKTTSLGSVIiSYSLTF — ELPSYTKEAVl^TVWSDl^ISEGLTFNFNSLTVE 

I 10 : = = II : | : :||| hi : :| :: || :|: 
gi -19224141 . SDLEiroAJTOSSAELGEIQVYEOT 

370 380 390 400 410 

260 270 . 280 290 300 310 

gi-50913506 i WKGKMANITEDG S VMVEISTTKIG I AXE VNNGFlSniiS F I YDSLE S I S PNI S YKA\A7NNKAI VG 

h= H':- I J. I HI : : : I | :| :•: ::: 

01-19224141 . MTKRQRIDFG IvINIQNKAF I IKV- -TGKTDQ SGKPLWQ SNIiAS 

420 430 440 450 460 

320 330 340 350. 360 
gi -.5 0 9 13 5 06 :. - 1SEGN'PNKAEFFYSIsnsrpa?KGNTY- -DWIiDKKPDKG^GlTSKEDSKIVYTY — Q^AF 

: I I 1 H.= I h i HH h i = • I H.I h i 

m-l:9224141i Fi^SEYAAF- : *PVG€HW^Q^ 

\ 480 490 500 



^iH509i3,5p6. ~Bmt>m ; 

• = I 3 I* ? — II I T ' : : : : : : : | z \ : ' : : : j p | z \ Yl \ \ I I "s - 

.01-19224141 . JO^-STDITCTI,^^ 

520 530 . 540 • 550 560 570 - 



• 4t. • t. .. 



430 440 450 460 

0i-.5O9135O6 . IiNTETYEIT- ANWVT — AO^TSANSKSTT^SDKNKATDNSEQVGWLKN 

; ; . I I I • I = H = -HI - II I H *s'i h i = | :: 

01-19224141 . QVTEKIxATVTVDTTKPAEEMVTWGS^ 

580 590 600 610 620 630 

470 480 490 500 510 520 

0i-5O9135O6 . G1FYS IT)SRPTGNDVKE AYIESTKAIiTDGTTFSKSNEGSGTVLIiETDI PNTKliGELPSTG 

IHh H: 

01-19224141 . RPDQRPAKIQVQLLQNGQKMPN^ 

640 650 660 670 680 
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gi-50913506. pep 

/home/morama/gas/pili/align/gi-219 09636 -pep 

gi | 21909636 |ref |NP__6 63904 .1 1 conserved hypothetical protein [Streptococcus pyoge 
nes MGAS315] 

SCORES Initl: 45 Initn: 98 Opt: 96 z-score: 66.1 E() : 3.8 

>>/home/morama/gas/pili/aligh/gi-21909636 .pep (344 aa) 

initn: 98 initl: 45 opt: 96 2- score: 66.1 expect ( ) : 3.8 
Smith-Waterman score: 181; 25.3% identity in 312 aa overlap 

(298-556:44-344) 

270 280 290 300 310 320 

gi-50913506 . TSPTKIGIAKKVNNGFNIjS ' 

:::::}::::: j : J : : : : 

gi-21909 63 6 . ATAIiGTASIiNQKVKAETAGVSEISrAKIjIvKK 

20 30 4Q 50 60 ' 70 

330 340 350 360 
gi-50913506, KGNTYDNLDKKPDKGNGIT SKEDSKI VYT YQ I AFRKV — - DSVSK 

= hl hh II l-IM =1 llh I zz I I! :||: 

gi-21909 63 6 . SGKTE33GIiEIKPGIVNGI/TEQIISYTOT^ 

80 90 .100 110 120 

... ; 370 380 390 4'Q0 410 420 

gi -509 13506. TPLI GAI FGV- YDTSNKL IDI VTTlSfKNGYAI STQVS SGKYKIKElrKAPKGY — SLNTETY 

I = h III- =1- = I 1.3 I 'iz : :[•: ::| | .: : |: | : 
gi-21909636. KQ — GDVEGITYDTKKtmmVWGNK^ 

130 140 . 150 160 170 180 

430 440 450 460 
gi-50913506 . EI TA!NWVTAT\?KTS AN SKSTTYTSDK NKATDNSE-QVGW — : LKNGI 

- I I : : s.Hh: s|s s|ssss I ~l III! 

gi-2 190 9 63 6 . K^KWSGNTGELQKEFDFTLTIiN^ 

190 200 210 220 230 240 

• * 

470. 480 490 .500 510 
gi-50913506 . FYSIDSRPTGISTOVKEAYIESTKT^ ETD- 

. hi I = hH || : | :||:| : = 11 

gi-21909636. SIQIiDKLPVGITYKVNEMEANK : DGYKTTASIi^GDGQSK3^QIibMEQKTDESADEIV" 

250 260 / 27.0 ■•■ V 280 :' 290 r ;: ^fb / 

• ; :.. . - . ;/.5'2p-. . .'. 530 . , : . u B'M v - VsKft- . .'•/••.:' 

gi-509l3SP6. Ipe^ ." : .-.V v V .1 

gi-2 19.0 9 63 6, WJmtDTQVP^TGWGOPifcAPFA^ 

310 • 320 330 340 : 

* ■ * 

... 

gi-50913506. pep ' - • ; 

/home/morama/gas/pili/align/gi-28810259.peri 

gi| 28810259 1 dbj |BAC63197.1 1 hypothetical protein [Streptococcus pyogenes SSI-lj 

SCORES Initl: 45 Initn: 98 Opt: 96 z-score: 66.0 E(): 3.8 
>>/home/morama/gas/pili/align/gi"-28810259 .pep (349 aa) 

initn: 98 initl: 45 opt: 96 Z-score: 66.0 expectO: 3.8 
Smith-Waterman score: 181; 25.3% identity in 312 aa overlap 
(298-556:49-349) 
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gi-50913506. 



: gi-28810259. 



?70 280 290 300 310 320 

NTKIGIAKFOT^GFEnbSFIYDSI^ 

: : : : z j z z z : : j r | : : : ; 

ATAIGTASLNQIWRAETAGV SENAKtilVKKTFDSYTDISrEVIiMPKADYTFKVE ADSTA 

20 30 40 50 60 70 



gi-50913506. 
gi-28810259. 



330 340 350 360 

KGNTYDNLDKKPDKGNGIT SKEDSKI VYT YQ IAFRKV DSVSK 

hh II | | S | :| ||h I is Ml :|h 

SGKTKIXSriEIKPGirraGkTEQIISYT^ 

SO 90 100 110 120 130 



370 380 390 400 410 420 

gi-50913506 . TPIjIGAIFGV— YIDTSMKIj IDI VTTNKNGYAI STQ VSSGKYKIKELKAPKGY — SUSJTETY 

I - h Ilh= sh - Ihl " : : | : =:| | :: |: | z 
gi— 28810259 . KQ — GDVEGITYIXTKKWTVDVYVGNKEGGGFEPKFIVSKE 

140 150 160 170 180 190 



gi^50913506 
gi-28810259 



430 440 450 460 
EITANWVTATVKTSAM — : SKSTTYTSDK NKATDNSE-QVGW LKNGI 

85 I I. = = :|: :{:=:= J ::| \\\\ 

K^/KKJSTVSGOTGEIiQKEFDFTLTI^ 

200 . 210 220 230 240 250 



470 480 490 500 510 

gi-50913506. FYSIDSRPTGI^VKEAYIESTKALTIXSTTFSKS-NEGSGTVLL — ETD - 

|:| | J J : | x|hl 5 :|j 

gx-28810259 . SIQLDKIiPVGITYKWIEMKAISrK— ■ : — DGYKTTASIiKEGDGQSKMYQIjDMEQKTDESADEIV 

260 . 270 280 290 300 

52.0 530- 540 550 

gi-50913506 . IPNTKLGELPSTGSIGTYIjFKAIGSAAI^GAIGIYIVKRRKA 

2 I = -I il =11 h I = I!:-' Iihl|:|| 

gi-28810259 . VTNKRDTQ VP -TGVVGTI^PFAVL 

310 320 330 340 



gi-50913506 .pep 

/home/moraraa/gas /pili /align/gasl5 . pep 
GAS15 GAS15 . 



3v 



Initl : , 42 . . ' Inifcix: . 68 



SCORES 



score 
(2 83 ^535 : 2^2 r-4#0 ) 



Qp&l . 2 r spore:;j,63.-l3^ E(i: 5 ■:, f :-\" 



gi-50913506 
gaslS.pep - 



260 - 270 . 280 290 

GKMANITETCSVMtfE^Ki^ 

. ||s: j:Ih: | | V: 



200 



210 



220 



230 



2'40 



250 



gi-50913506 
gas 15 .pep 



310 320 330 340 350 * 360 

AIVGEEGNPNKAEFFYSmP^ — V 

= 1 I • s I ? hi s I* = II I : s : : | : : : | | : 

SIFESEDKGDKYZSTKGYQNLLSGGLVPT KPPTPGDPPMPPNQPQ TTSVXilRKYAI 

260 270 280 290 300 
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gi-50913506 . DSVSKTPLIGAI FGVY-DTSNKL- IDIVTTNKNGYAI STQVSSGKYKIKELKAPKGYSUST 

: II I 11 = : I : I * : = ^1 I I - = hl I : Ih = I |||: 

gasl5 .pep GDYSKL-LEGATIiQLTGDNVNSFQARVFSSjNDIGERI — ELSDGTY TLTELNS P AG YS I A 

310 320 330 340 350 360 

430 440 450 460 470 
gi-50913506 _ TE-TYEXTANWVTATV KTSAWSKSTTYTSDKNKATDNSEQVGWLKN GIFYSIDSR 

h« h I = = I i « = H — h - I = =11 =: 

gas 15. pep EPITFK^/EAGK^TYTIIDGKQIENPIS^ 

370 380 390 400 410 420 

480 490 500 510 520 530 

gi-50913506. PTGI^VKEAYIESTKALTDGTTF^ 

„ = * |* .1 . {::): h 1* II s | | ||| 

gasl5 • pep NGSSQWYCFNADLKSPPD SEDGGKTMT PDFTTGEVKYTHIAGRDLFKY 

430 440 450 460 470 

540 550 
gi-50913506. 1GSAAMIGAIGIYIVKRRKA 

- 

gas 15 .pep TVKPRDTDPDTFLKHIKKVIEKG^ 

480 * 490 500 510 520 530 

gi-50913506. pep * ■ 

/home/morama/gas /pili /align/gi-13 62142 8 . pep 

gi|l3621428|gb|AAK33238.1| hypothetical protein [Streptococcus pyogenes] 

! - * 

SCORES Initl: 41 Initn: 41 .Opt: 87 z -score: 63.6 E() : 5.2 
>>/home/morania/gas/pili/align/gi -13621428. pep (340 aa) : 

initn: 41 initl: . 41 opt: 87 Z-score: 63.6 expect(j: 5.2 
Smith-Waterman score: 109; 22.6% identity in 345 aa overlap 

(256-556:17-340) 

230 240 250 260 270 280 
gi-50913506 . VNKTVWSDNMSEGLTFN 

! i . I ::| — h "I 1| :: 

gx-13621428. MECLRHXiIiliTGA^ — WtiDl* 

10 20 30 40 

• • • 

, 290. ; 300 310 : .320.. . 330 . 340 ' V.\ V ' . : '\ . \. \ 



""if*'.-. . . \v -i a - 



. • „ ;350 ; • • - ■ :'370 ■: * : 

gi-50913506. TSI^D-SKI 



v '«5 



gi"~13621428. 1PAEFDFSEWFEKP<3VYYYKVTEEK2D 

- • - lbo -no ' 120 ■ '130 iio"r •'•■■"*• "iSV:"^\ ? W^?<'S?- T« 



• 380 390 400. 410 .420 4^0 ' ./ 

gi-50913506. GVYDTSJJKXIDr^T-l^KNGYAISTQVS^SG]^^ 

I * I : I = s 1 1 | : | : |::r " | : | : : : | : ; : | 

gi-13 62142 8 . GYKEGSKVPIQFKNSLDSTTLTVKK3CVSGTGGDRSKDFNF — GLTLKANQYYKASEKVMI 

1^0 170 180 190 200 210 

440 450 460 470 . 480 490 

gi-5091350 6 . TVKTSANSKSTTYTSDKNKATDNSEQVGWLKNGIFYSrDSRPTGND — VKEAYIESTKAL 

H : :::: - 1 -h : : : I : I ] | | :| | 
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gi -13 621428 . E -KTTKGGQ APVQT EASIDQIiYHFTLIOCk^ES IKVT1STLPVGVDYWTEDDYKSEKYT 

220 230 240 250 1 260 

500 510 520 530 . 540 

gi-50913 506 . T DGTT FSKSNEGSG'I^/LLETDIPISrTKLGELPSTGSIGTYLFKAIGSAA 

1 lh: 1=11 : = 1 -I hi: :: is : |:| z\ 

gi-13621428 . TNVEVSPQIXSAV^IAGNSTEQETSTDKD^ 

270 280 290 300 310 320 

550 

gi-50913 506- MIGAIG I YXVKKRKA 

: ||: |:|J::«I 
gi-13621428. VGGAL— YFVKKKNA 

330 340 

* ♦ 

■ 

gi-50913 506. pep 

/home /mo r ama /gas/pili/ali gn /gi~19224135.p ep 

gi 1 19224135 |gb|AAL86406. 1 1 AF447492_3 Cpa [Streptococcus pyogenes] 

SCORES initl: 41 Initn: 41 Opt: 86 z-score: 61.0 E() : 7 
>>/home/morama/gas/pili/align/gi-192 24135 .pep (756 aa) 

initn: 41 initl: 41 opt: -86 Z-score: 61.0 expect () : 7 
Smith- Waterman score: 101; 19.6% identity in 306 aa overlap 

(243-535:193-462) 

.. .. .220 . 230 240 250 . 260 270 

gi-50913506 . SLTFELPSYTKEAVI^TVYVSDIS^ 

I « . . .11.. | . . I . 

Cfi -19224135 . PKNAN^GY^KIEPI^AIIiWQQAVWYYSDSSYGNIKTIiWASEh KDGKIDFEQVKL- 

170 180 190 200 210 

280 290 300 310 320 330 

gi-50913 506 . lAKEVNHGFNLfSFIYDSLES I S PNIS YKAVVNNKAIVGEEGNPN — KAEFFYSNNPTKGN 

-h • -I hi! I I 1 : I :|h = h h 

,gi.-19224135 . -MREAYS KLISDDLEETSKI^hPQGSKXiNIFVPQDKSVQI^hSAEYVPESPPAPGQ 

220 ' 230 240 250 260 270 

340 350 360 370 \ 380 

.gi-50913506,. TYDI^DJO£BDKGNGITSK3^S - 




:m 

gi-19224135 ; /^QSNG^E^-^ELS^ 

. .320., 330 . .340 r - V^^ 



450 460 470 ' 480 - 490 

gi-50913506, r-* YTSDKNKATDNSEQVG- — m.KNGiFYSXpsj^^ * 

!•'-"•-' ' • •. . I v . . I \^\ . '. . . . •-■ I •* * . I „" 

<drx-19224135 . EVAEPYSV^YSDMQDSI^N^ ADLHSP 

380 * 390 400 410 420 430 

500 - 510 520 . 530 540- 550 

gi-50913506 . SKSNEGSGTVLIjETDIPNTK^ 

:h = hlh = II s I h I ' h ill 

gi-19224135. PESEDGGGT1 — DPDISTMK — EVKYTHTAGSDLFKYALRPOTTNPEDFLKHIKKVIEKG 

440 450 460 470 * 480 
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gi -19224135 . YNKKGDSYNGLTETQFRAATQLAIYYFTDSTDLKTLKTYNNGKGYHGFESMDEKTIAVTK 

490 • 500 510 520 530 540 



gi-50913 506.pep 

/home/moraroa/gas/pili/ a ligm/gi-50913505.pep 

Sil0394r 5,ref,YP -° 59477 - 11 C ° llagen lesion protein [Streptococcus pyogenes M 

SCORES mitl: 70 Initn: 149 Opt: 86 Z -score: 60.1 E(): 7.8 
^/home/morama/gas/pili/align/gi-SOgiaSOS.pep (i 03 6 aal 

onitn: 149 initl: 70 opt: 86 Z-score: 60.1 expect () : 7.8 

■^?25 t S? 8 ?«f OE9: 120 21 - 5% identit V in 469 aa overlap 

(33-428:503-966) - 



10 20 30 40 50 60 
gx-50913506 . NimETVREK^LITAKKLMLACL^ 

gi-50913505. YSQWVYHGTFDYSSYQGESFNRGQIHVHYRGIVSVSDGIRRD I 

480 490 500 510 520 

70 ' .80 90 loo 110 

cri-50913506. Y^GEGVYMTNGDSFINFK YAEGVSLTETGPTSQEIT-TIANGINTGKIKPFSTEN 

CTi-50913505 . ^RFVNINPENKXSVIGFQGSADYHAGKWYPDQSPRGGFYQPNIJ^ 

530 540 550 560 570 580 

• C n Q ^„n. 120 136 140 150 160 

cri-50913506. VSISNGTATYHARGiVSVY.Iftl.LTGAT DGRTYNPIMAAS. YNGEGNI.VTK 

: I : : : J : : : | | | ; III | . 1 | I . 

g-x-50913 505 . i«ram«MmiHonmajui^^ 

590 600 610 620 630 640 

170 180 190 200 210 

gx-50913506 . NXDS— KSNYLYGQ— TSVA KSSLPSITKKVTGTIDDVMKKTTSLGSVLSYSDTFE 

I : I : " I = I :::| j : | : : : | : I : I : I - | - I - • I - I 

gi-50913505 . NGSSHDRNJWTRSQEGSKLAIDEFKARYPNIiSiYSIiGVSKDINSDTASSSP 

. 660 670 680 690 700 



-220 . 230 




• IVJL f' 



• • -in 

« a 



, . 240 250 ... . 260 

• . 750 ■ : ..' i£b • ' " 



\__ ' : . : . . . - v 270 , 280 290 
STi 50913506. — : r^-^^ 

gi~5O913505 ; ILYQKI3QVQEiU3K3}II 

820 




800 



810 



^ . 300 ',..310 320 ."330 340 350 • 
cji- 509 13 5 0 6 , SYKAVVKNKAIVGEEGNPK^VEFFYS 

: I : - | : - :| |: : : : : | | : : I I - • • - f - | | 

giL-50913505 . AYKKYKDNEGKYSEM^ 

830 840 850 860 870 880 



360 



370 



380 390 400 

cp--50913506 . IVYTYQIAFRI^S^^ 

M 1 = 1 III: | |: | - ::J:= : ::: .J j j 



FIGURE 58M 




WO 2006/078318 ~ J-OO / LL^l 

p p , J. „ c . ™. |{:;: ,, . p „ 7 p 

gi-50913505 . WKTVPITFOTEVDAI^Q^ 

8 - u 900 Q1n 920 930 940 



CT/US2005/027239 



900 



910 



410 



420 



430 440 450 460 

cri-50913506 . IKEL^PKGYSIOTETYEITANWVTATVKTSANSKSTTYTSDKNKATDWSEQVGWLKNGI 

gi-50913505 . LYETKA^I/SYTLiPEISrPWE VAVANNGD IKVEUP IEGELKSKDG S YMI 

950 960 970 980 



990 



1000 



gi-50913506.pep 

/home/morama/gas /pill /align/gi -13 621430. pep 
.giil3621430lgbiAAK33240.il Hypothetical protein [Streptococcus pyogenes] 

- - * 

SCORES Initl: 43 Initn: 67 Opt: 67 z-score.- 59.2 E(): 8.7 
>>/home/morania/gas /pili/align/gi-13 621430. pep (215 aa) 

xnitn: 67 initl; 43 opt: 67 Z-score: 59.2 expect <) : 8.7 

^ss^se*^ 1 67; 27 - 8% identity in 90 aa ° veriap 
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/Home/moraina/g;as/pili/a.lig^/g;i~19745303.pep 



|i j 19745303 I ref |m ? _606439 . l'| hypothetical protein I Streptococcus pyogenes W<AS 82 

.• ■ • ■ v.. ... ■■ •-■ • ' • '-• ■• ■ • ■ 

SCORES .Initl-.Sl- . Initn: 10.6 Opt: 69 z-score: 58/4 E(>: 9.5 

>>/home/moraraa/gas/pili/aligii/gi-19745303.pep . 0344 aa) • 

initn: 106 initl.: 51 ppt:l 69 Z-score: 58 .4 expecfe() : 9 5 ■- 

S^t^-ffateraian score: 129; - 24.0% identity in 308 aa overlap 

- (298-556:44-344) ... *» weri^> . 
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I CPU time used: 

I Database scan: 0:00:00.1 

I Post-scan processing: 0:00:01.9 
I Total CPU time: 0:00:02.0 

J Output File: gi-50913506 .f asta 
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(Peptide) FASTA of-, gi-13 621430 .pep from: 1 to: 215 September 15, 2004 18:45 

gi 1 13621430 |gb j-AAK33240 .1 1 hypothetical protein [Streptococcus pyogenes] 

TO: * -pep Sequences: . 56 Symbols: 22 , 803 Word Size: 2 

Scoring matrix: GenRunData:blosum50 . cmp 
Variable pamfac tor used 

Gap creation- penalty: 12 Gap extension penalty: 2 



Histogram Key: 
Each-histogram symbol represents 1 search set sequences 
. z -scores computed, from opt scores'. 
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Joining threshold: 36, opt. threshold: 24. opt. width: 16, reg -.-scaled 



.The best scores are:' 

/home/morania/gas/pi.li/align/gi-13 621430 
.! gi|i3621430lgb|AAK33240;i| hypotnet. 1 
/home /mpr axna/ga^ /pi li 7 al ign / gi - 19 7 453 0 5 
! gi| 19745305 jref |NB_ed6441 ."if hypptn; . 

/home/morama/gas/pili/arign/gi--2 88102 61 
! gi| 28810261 j dbj | BAC63199 . 1 1 hypothe . . . 

/home/morama/gas/pili/align/gi-19224139 
! gi|i9224139|gb|jUkL86410.l|AF447492_. , . 
/home/morama/gas/pili/aiign/or-f 82 .pep 
! TRANSLATE of: orf 82 .seg check: 4296; . , 
/nome/morama/gas /pill /align/gi-21909 63 8 < 
! gi [21909638 jref |NP_663906.1 1 hypoth. . . 
/home/moraina/gas/pi.li/aligri/gi-19745303. 
! gri (19745303 |ref|NP__606439.l| hYPOth.'... 
/home/raorama/gas/pili/align/gi-13 621428 . 
i gi|13621428jghfAAK33238.1| hypothet. . , 
/home/morama/gas /pili/align/gi-19224137 - 
S gi J 19224137 | gb j AAIf86408 .1 [!AF447492_- „ . 
/home/morama/gas /pili /align/gi-50913 5 03 . 
« gi| 50913503 |re£|YP^059475.l| Fibiron. . . 
/home/morama/gas /pili/align/gi -1 92 2 413 4 . 

! gi 1 19224134 |gb|AAr,86405.1|AF447492_.. . 
WEnd of List 



initl inifcn opt z-sc E(55) 



-pep 
... 1338 
.pep . 
. 163 
.pep 
. 164 
.pep 
. 164 
Begin 

163 
pep 

164 
pep 

121 
pep 

58 

pep 

88 

Pep 

73 

pep 

73 



Begin: 1 End; 215 
1338 1338 233.9, 1.8e-G9 
Begin; 1 End i/- 193; ; 
. 243 273 . :82<a o.S 
Begin: 2 End: IB 7 

239 268 81.5 0.55 
Begin: 2 End: 187 

236 265 81.0 0.57 
: 30 End: 222 

235 264 81.0 0.58 
Begin: 2 End: 181 

239 261 . 80.5 0.62 
Begin: 84 End: 183 

121 126 61.4 6.7 
Begin; 6 End: 174 

86 ' 122 60.9 7.2 
Begin: 93 End: 201 

88 119 60.4 7.5 
Begin; 549 End: 625. 

73 117 .60.4 7.6 
Begin: .631 End: 697 

73 115 60.1 7.8 



gi-13621430. pep ■ * 
/hoiae/morania/gas/pili/align/gi-13 621430 .pep 

gi|l3621430|gb|AAIC33240.i| hypothetical pr^fcein CStfrepfc P c6c<!ms p^g^nes) 



SGQREiS l^tli 



^ < : 1338: ; -jjni:^:;.,i3^p- : ;iDp&; y'isS&f.' : , s&se&k&i 

wy^y^U^^a^t^^tt^^;- • \" " : aa^r : : • : 

xnxtn: 1338 . iixxfci: 1338 opt: 1338 Z-score: 233 .9 expect ( ) :. i.8e-09 
Sn^th-Waterinah scbre; .1338; 100 .0% identity in 215 aa overlap 
(1-215:1-215). 
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gx-13 621430. LPSTGEWSYVSAI^rw J VA'ria?LYSIYKKLKTSK 

190 200 210 ' . 

• * 

gi-13621430. pep . 
/nDme/ko/^ 

... . . , - • - • *♦ 

gi J 19745305 1 ref | NP^606441 . 1 1 hypothetical protein [Streptococcus, pyogenes. MGAS82 

* » * 

SCORES Initl: 163 Initn: 243 Opt: 273 z-score: 82.2 E()- 0 5 
»/hoine/morama'/gas/pili/align/gi-19745305.pep (195 a a> * 

.^xnitn: 243 ihitl: 163 dpt.: 273 Z-score: 82.2 expect (): 0.5 
Smxth^-Waterman score: 320; 31.9% identity in 213 aa overlap 
(1—213:1 193) 
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gi-13621430. pep 

. /home/morania/gas /pili/align/gi -2 $ 81 0261 . pep 



gi|28810261Iabj|BAC63199.l| hypothetical protein [Streptococcus pyogenes SSI-ij 

SCORES Initl: 164 Initn:.. 239 Opt: 268 z-score: 81.5 E(): 0 55 
»/home/morama/gas/pili/align/gi-28810261.pep (189 aa) 

initn: .239 initl: 164 opt: 268 Z-score: 81.5 expect()- 0 55 
Smith-Waterman score: 306; 30.6% identity in 206 aa overlap 
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( 8-2i3pqBW' U Eii O 5 ,/ Ei! 7 S 3 "9 

10 20 30 40 50 60 

gi-13621430 . MKKS I LRI LAI GYLLMS FCLLDS VEAENLT AS INIEVINQ VDVATNKQ S S DI DETFMFVT 
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gx-28810261 - MLF SWMI LTKLAFWQTVLAKDST VQT S I S VENVLERAGDSTP FSIAL 
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. 000 . w .= : ^ f-ll=il 111: IJM j:::| ::|::|:.| || :.: J : 

gx-28810261. ESIDAMKTIEE— — ITXAGSGKASFSPIiTFTTOGQyTyRVYQKPSQNKDYQ ADTTVFDVL „ ^11 

50 ^0 . 70 80 90 100 . ' " T 

. ■ • 
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gx -13 621430. l^^^flreQSGALETNLVSI^^ 

= 14- I - I = hi ?::.:| : h III s II : = I | : | | ; |: 
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12,0 . 13t> ' 140 : 150 . ' '. 



' . 190 200 210 
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II HI = I IHU- s.sT IMh 
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160 170 180 ; 

gir-13 621430. pep 

/home/m6x-aina/gas/pili/alxgii/gi-l9224139.pep 

- " * 

gx 1 19224139 | gb | AAL86410 . 1 1 AF447492_7 xuxknown [Streptococcus pyogenes] 

V 

SCORES Initl: 164 Initn: 236 Opt? 265 z-score: 81.0 E(): 0.57 
>>/home/moraina/gas/pxlx/a.lxgn/gi-19224i39.pep (189 aa) 

initn: 236 initl: 164 opt: 265 Z-score: 81.0 expect ( ) : 0.57 
Smith-Waterman score: 303; 30.6% identity in 206 aa overlap 

(8-213:2-187) 
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gi-13 62143 0 . MKKSILRIIiAIGYLLMSFCLLpSVI^^^ 
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gi^-13621430.pep 4 

/home/morama/gas /pili /align/or f 82 .pep 

TRANSIATE of: orf 82. seq check: 4296 from: 1 to- 672 
generated symbols 1 to: 224. 
GETSEQ from morama, September 13, 2004 17:09. 
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SCORES Imtl: 163 mitn: 235 Opt.- 264 z-score: 81.0 E() : 0 58 
>>/hpm ? /mord^/gas/pili / a 1 ig n /otf82.pep (224 aa) 

initn.: 235 iwtl, 163 opt: 264 Z-score: 81.0 expect!):. O.S8 : 

^l^lT^T^) SPOire : 3 Q4 '' 3 ° * 5% i^tity in 213 aa overlap 
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.;9a^iAp^Jt^-u -pep 

. . • ...... '.*"•'• 

gi| 21909638 1 ref |NP^663906.1 J hypothetical protein [Streptococcus 



pyogenes MGAS31 



SCORES. Initl: 164 Initn: 239 Opt: 261 z-score: 80.5 ~ B() s 0 62 
>>/nome/morama/gas/pili/align/gi-21909638.pep (183 aa) 

initn: 239 xnitl: 164 opt: 261 Z-score: 80.5 expect (): 0 62 
Smith-Waternan score: 3Q2; . . 31.5% identity in 200 aa overlap 

(14-213:2-181) ' P 



) 



60 



1° 20 30 40 50 

gi-13621430 MKKS I LR1 LAIG YL LMSFCLIiDSVEAElSnjTAS INIEVXETQVDVATNKQS SDX DETFMFVT 

: I : • =-l I-.- h: HI [ :: J t: I ... 

, raLTMIiAFNQTVLAIODSWQTSISVElWLERAGDSTP FSIAL 



gi-21909638. 



FIGURE 59D 




WO 2006/078318 5(37 j 

||: „ _ ...«,.. » , } j,-,.. n {J;:; .. , »■;;!» p Q 




PCT/US2005/027239 



I'6 20 30 40 

7 0 80 90 100 110 120 

91-13621430 . EAIiDKESPLPNSVTTSvKGNGKTSFEQI^^ 
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50 60 70 80 90 

130 140 150 160 1?0 ' 180 

gi- 13 62143 0; I Y^YiraQSGALETJSrL^^ 

, -N I H = IM = .? |r ML * M I I : | | : h 

gx-2 19 0 9 63 8 . VYVTYDED-GTL7VAKVT SRRAGDEEKSAITFKPKm, VTCPI P PRQPNI — - — PKTP ^~ 

' \ : . • 100 110. 120 13Q 140 
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• * • ■ * a " * ■ 

o * 

gi-13 621430. pep , / . 

/hoiue/morama/gas/pxli/align/gi-19745303 .pep 

gi| 19745303 |ref|NP_606439.1 1 Hypothetical protein [Streptococcus pyogenes MGAS82 
32]> 

SCORES Initl : 121 Initn: 121 Opt: 126 2 -score: 61.4 E() :. 6.7 
>>/horae/morama/gas/pili/align/gi-19745303.pep (344 aa) 

initn: 121 initl : 121 opt: 126 Z -score: 61 . 4 expect < ) : 6.7 
Smith-Waterinan score: 126; 27.0% identity in 100 aa overlap 

(59-155:84-183) - 

■ 

30 40 50 60 70 80 

gi-1362143 0 . LTAS INIEVlNQVDVAT^^ — NSVTTSVKGNG-KTSF 

M-J— : = M |::| :: : :| 
gi-19745303 ; LMPKADYTFKVEADDNAKGKTKIX5LDIKPGVIIX5L 

60 70 . 80 90 100 . 110 

90 100 110 12Q 130 . 140 

gi-13 621430 . EQLTFSI^GQYHYKIHQLx^^ 

• 1Q „^' zz 1 11 ,:| : :z 11 : 1 i-m mi^ r«.- 

gx-19745303.. 2tf)*VKFPWGVYRYOTSEVNG^ 

120 130 146 150 160 170 



gx~l^f 2i430 T ^E&?&KQ$YS^ 

I • • • I I • - • • 
gi-197453 03 „ KJCPVi^KNFFDTT^^ 

180 190 200 210 220 230 

* 

gi-13621430.pep . 
/home/morama/gas/pili/align/gi-13 621428 .pep 

gi J 13621428 |gb |AAK33238 .1| hypothetical protein [Streptococcus pyogenes] 

SCORES Initl: 58 Initn: 86 Opt: 122 z-score: 60.9 E(> : 7.2 
> ^/home/nioraiiia/gas/pili/align/gi-13621428.pep ' (340 aa) 

initn: 86 initl: 58 opt: 122 Z-score: 60.9 expect () : 7.2 
Smith-Waterman score: 135; 29.1% identity in 172 aa overlap 

_ 

FIGURE 59E 
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gi -13621428 . MKLRHLLM^AAI/TSFA^^ 

10 20 30 40 50 

. - . 6 Q 70 80 90. 100 
gi-13 62143.0.. -VTEALDK-- :-ESPLPN-SVTTSVKGNGKTSFEQLTFSKV GQYHYKI-H 

I I H V|: : -r I = I 1 1: : : I : MM | | : J | .: : 

gr-13621428. PDTTVfcnSDGNKFKGVAI^ 

•60-70 80 90 100 ,110 

"... . 

110 120 . 330 140 . 150 160 

gi-13621430 . Ql*U2KNSQYHYVETVYEV^ 

. J > .V I H I I 1 ::i|:M.h * I = U I t | : ||: : |- 

gx-13621428. EIOIDKVI^SYOTTSY^ 

»\. " ; * • - 140 150 .160 170 '• 

.*■•■"■- • ■ . . • . 

170 180 190 200 210 

gi~13621430. EPHQPOTTEKEKPQKiaOTGI^^ 

gi-13 621428 . TVRKKVSGTGGDRSKDFISnFGIjT^ 

180 190 200 210 220 230 

* 

gi-13 621430. pep ... 
/nbme/moraina/gas/pili/align/gi-19224137 .pep 

■ * 

gi|l9224137|gb|AAL86408.ljAF447492_5 Eft LSI*. A [Streptococcus pyogenes] 

. » 

SCORES Xni.tl,: 88 . Initn: 88 Opt: 119 ^ -score: 60.4 E(): 7.5 

>>/Home/mdrama/gas/pili/aligii/gi-19224137;pep (342 aa) 

initn: 88-initl: 88 opt: 119 2-score: 60.4 expect () :. 7.5 
Smitn-WateriQan score: 119; 29.7% identity in 111 aa overlap 
, (72-176:93-201) 

50 60 .70 80 90 

gi-13621430 . DVATlIKQSSDIDETFMFvTEAIiDK^ TSFEQLTFSEVGQY 

I —-- .I -II ' ts\ ::| | I I | 

gi -19224137.. Sv^DSA&TGTESI^PIKPG:^^ 

70 80 90 100 110 120 



100 HQ .120 130; 140 ; 

gi-13 621430;. 

gx-13224137> lOVVEEirafcTAEGVT^ 

130 * ±40 150 16d 170 ±60 

1^0 lia . 180 190 200 210 

gi -13621430. SEKTPEPHQPDTTEKEKPQKia^ 

I I : is| : -II 

gi-19224137. ^EOTSI^EKEOTHSNTG^^ 

190 200 210 220 230 240 

gi-13621430 -pep 

/home/morama/gas/pili/align/gi-50913503 .pep 

gi | 50913503 | ref | YP_J)59475 .1 j Fibronec tin-binding protein [Streptococcus pyogenes 
MGAS10394] 

FIGURE 59F 
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SCORES Initl: 73 Initn: 73 Opt: 117 z-score: 60.4 E{) : 7.6 

>>/home/morama/gas/pili/align/gi-50913503 .pep (627 aa) 

initn: 73 initl: 73 opt: 117 Z-score: 60.4 expect(): 7.6 
Smith-Waterman score: 118; 28.7% identity in 87 aa overlap 

(129-215:549-625) 

1Q° HQ 120 . 130 140 150 
gi-13621430 . KIHQLIjGK2tfSQ"raYDEW 

gi-5 0913503 . lETEDTKEPEVI^GGQSESVEFTK^^ EDTRPKLVFf^DNNEP 

520 530 540 55.0 560 570 

16° 170 180.. 190 200 * 210 

gi-13621430 . TPEPHQPDTTEKEKPQKKI^GII.PSTGEMVSWSAI^IVLVATITLYSIYKKLKTSK 

gx-5O913503 ? KVEEM— ^KPTKNITPiriPATGDIENVI^ : 

. 580 : 59Q . $qq . - . . 61€ 620 

. * ^ - » 

* 

• ■ 

gi-13621430 :pep • ; ■ 

/home/morama/gas/pili/align/gi-19224134 .pep 

gi 1 19224134 1 gb j AAL8 6405 . 1 1 AF447492_2 protein F [Streptococcus pyogenes] 

SCORES initl: 73 Initn: 73 Opt: 115 z-score: 60.1 E(); 7.8 
>>/home/morama/gas/pili/align/gi-19224134^pep (698 aa) 

initn: 73 initl: 73 opt: -115. Z-score:. 60.1 expect (): 7.8 
Smitli-Waterman score: 115; . 27.4% identity in 73 aa overlap 

(143-215:631-697) . 

120 . 130 . 140 ; 150 160 170 

gi-13621430. DETVYEWIYVLYNEQSGALETN^^ 

| : V|:|: : :| | : : || 

gx-19224134 . VLMGGQ S ES VE FTKDT QTGMSGFS ETVT I VEDTRPKLVFHFDNNE PKVEEN REK 

610 620 630 640 . 650 

*■ ' . ' 

180 190 200 210 

gi-13621430. PQKKRNGXDPSTGEMVSWS^^ 

I i- s |||:||tr.: :: | || :::::::: : |. : | : ::: | 
gi -19224134. PTK^ITPII^PATODIEftV^ 

660 670 680 690. 



! Coirapletion time: Wed Sep 15 18 ; 45 s 38 2004 

! CPU time used: 

! Database scan: 0: Op: 00.1 

I Post-scan processing: 0:00:00.3 
1 Total CPU. time: 0:00:00.4 

! Output File: gi-13621430 . fasta 
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protein F2 like f ibronectin-binding protein 

Length; 696-733 

LPXTG 

pilin motif consensus PK (X 7 ) K 
E box consensus ETatAPxGY 



SpyM3„0104/21909640 
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pilin motif 



E box 



pilin motif 



E box 
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SpyM18_0132/19745307 pilin motif 



E box 
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YTFVETAAPDGY 



orf 84 



pilin motif 



E box 
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PKEKPIIYFK 
512 

YTFVETAAPDGY 



TYPE 4 pilns motifs 

protein F2 like f ibronectin-binding protein 
Length: 1X6 X 



in motif consensus PK (X 7 „ 8 ) K 



tfhe protein is longer than the proteins belonging to TYPE 3 and has 4 possible pilin 
motifs and 2 E boxes 



19224141 



pilin motifs 
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E boxes 
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LEGEND A: 

Ml tot: total extract (Ml) 

Ml prot. Sup.: fraction enriched in 

surface proteins 

#: Purified recombinant proteins, 30 
ng 

I oc-#: immune sera against # 
P OL-tti pre-immune sera 



Pa-15 



Figure 91 
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LEGEND: 

Ml tot: total extract (Ml) 

Ml surf prot.: fraction enriched in 

surface proteins 

#: Purified recombinant proteins, 30 
ng 

I otr#: immune sera against # 
P a-#: pre-immune sera 
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Figure 92 
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Figure 93 
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LEGENDA: 

Ml tot: total extract (Ml) 

Ml prot Sup.: fraction enriched in 

surface proteins 

#: Purified recombinant proteins, 30 
ng 

I oc-#: immune sera against # 
P 0C-#: pre-immune sera 
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Figure 94 
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LEGEND: 

Ml tot: total extract (Ml) 

Ml surf prot.: fraction enriched in 

surface proteins 

#: Purified recombinant proteins, 30 
ng 

I a-#: immune sera against # 
P oc-#: pre-immune sera 
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Figure 95 
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LEGEND A: 

Ml tot: total extract (Ml) 

Ml prot Sup.: fraction enriched in 

surface proteins 

#: Purified recombinant proteins, 30 
ng 

I ot-#: immune sera against # 
P Or#z pre-immune sera 
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Figure 96 
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LEGENDA: 

Ml tot: total extract (Ml) 

Ml proL Sup.: fraction enriched in 

surface proteins 

#: Purified recombinant proteins, 30 
ng 

I oc-#: immune sera against # 
P cc-#: pre-immune sera 
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Figure 97 
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Figure 98 




LEGEND: 

M6 tot: total extract (M6) 

M6 surf prot.: fraction enriched in 

surface proteins 

#: Purified recombinant proteins, 30 
ng 

I oe-#: immune sera against U 
P a-#: p re-immune sera 
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Figure 1 02 



LEGEND: 



I a-#: immune serum anti-# 

P a-#: pre-immiiae serum antib-# 
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Ml, strain 2913 
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Figure 1 03 



LEGEND: 

I a-#: immune serum anti-# 

P cc-#z pre-immuue serum anti-# 
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M1 strain 3348 
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Figure 1 05 
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M1 strain 2719 
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Figure 1 06 
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Western blot Western blot on fraction enriched in surface proteins of Ml (SF370) 
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Western blot on fraction enriched in surface proteins of M12 (2728) 
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Note: 0102 corresponds to SpyM3_0102 in MGAS3 1 5 (M3). This protein is pretty much identical 
to the M12 19224139 -see also geTWB M12 (23-1 l-4b2a)"-, the M18 spyM18_J)130 and the M5 
orf 82. Here the antiserum raised against 0102 recognizes high-weight molecular structures in M12 
(strain isolate 2728); This means that in M12 three out of the five LPKTG proteins form high- 
molecular weight structures: . 

a 

Figure 108 




Note: 0102 corresponds to SpyM3_0102 in MGAS3 15 (M3). This protein is pretty much identical 
to the M12 19224139 -see also gel"WB M12 (23-1 l-4b2bis) 5 \ the Ml 8 spyM18j013O and the M5 
orf 82. Here the antiserum raised against 0102 recognizes high-weight molecular structures in 
protein extracts from M12 (strain isolate 2728.) 
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Western blot on fraction enriched in surface proteins of M6 (2724) 
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FIGURE 114 
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FIGURE 116 
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FIGURE 117 
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FIGURE 118 
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FIGURE 121 
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FIGURE 122 



1 

WO 2006/078318 PCT/US2005/027239 




FIGURE 123 



t 



) 



WO 2006/078318 PCT/US2005/027239 




FIGURE 



124 



WO 2006/078318 



PCT/US2005/027239 



„ )i » lJ'"il 11'"" '\< <■ 

p c II ■ / " U ■* - ,l 11 



II 

»*"»' 




FIGURE 125 
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Figure 127 
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Figure 128 
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